Tuesday, July 1, 2025
Social icon element need JNews Essential plugin to be activated.
No Result
View All Result
Digital Currency Pulse
  • Home
  • Crypto/Coins
  • NFT
  • AI
  • Blockchain
  • Metaverse
  • Web3
  • Exchanges
  • DeFi
  • Scam Alert
  • Analysis
Crypto Marketcap
Digital Currency Pulse
  • Home
  • Crypto/Coins
  • NFT
  • AI
  • Blockchain
  • Metaverse
  • Web3
  • Exchanges
  • DeFi
  • Scam Alert
  • Analysis
No Result
View All Result
Digital Currency Pulse
No Result
View All Result

Experiment with Gemini 2.0 Flash native image generation

March 14, 2025
in Artificial Intelligence
Reading Time: 3 mins read
A A
0

[ad_1]

In December we first launched native picture output in Gemini 2.0 Flash to trusted testers. At present, we’re making it accessible for developer experimentation throughout all areas presently supported by Google AI Studio. You’ll be able to take a look at this new functionality utilizing an experimental model of Gemini 2.0 Flash (gemini-2.0-flash-exp) in Google AI Studio and through the Gemini API.

Gemini 2.0 Flash combines multimodal enter, enhanced reasoning, and pure language understanding to create pictures.

Listed below are some examples of the place 2.0 Flash’s multimodal outputs shine:

1. Textual content and pictures collectively

Use Gemini 2.0 Flash to inform a narrative and it’ll illustrate it with photos, holding the characters and settings constant all through. Give it suggestions and the mannequin will retell the story or change the type of its drawings.

Sorry, your browser would not assist playback for this video

Story and illustration technology in Google AI Studio

2. Conversational picture modifying

Gemini 2.0 Flash helps you edit pictures by means of many turns of a pure language dialogue, nice for iterating in the direction of an ideal picture, or to discover completely different concepts collectively.

Sorry, your browser would not assist playback for this video

Multi-turn dialog picture modifying sustaining context all through the dialog in Google AI Studio

3. World understanding

In contrast to many different picture technology fashions, Gemini 2.0 Flash leverages world information and enhanced reasoning to create the suitable picture. This makes it good for creating detailed imagery that’s practical–like illustrating a recipe. Whereas it strives for accuracy, like all language fashions, its information is broad and basic, not absolute or full.

Sorry, your browser would not assist playback for this video

Interleaved textual content and picture output for a recipe in Google AI Studio

4. Textual content rendering

Most picture technology fashions battle to precisely render lengthy sequences of textual content, typically leading to poorly formatted or illegible characters, or misspellings. Inner benchmarks present that 2.0 Flash has stronger rendering in comparison with main aggressive fashions, and nice for creating ads, social posts, and even invites.

Sorry, your browser would not assist playback for this video

Picture outputs with lengthy textual content rendering in Google AI Studio

Begin making pictures with Gemini at this time

Get began with Gemini 2.0 Flash through the Gemini API. Learn extra about picture technology in our docs.

from google import genai
from google.genai import sorts

consumer = genai.Shopper(api_key=“GEMINI_API_KEY”)

response = consumer.fashions.generate_content(
mannequin=“gemini-2.0-flash-exp”,
contents=(
“Generate a narrative a couple of cute child turtle in a 3d digital artwork type. “
“For every scene, generate a picture.”
),
config=sorts.GenerateContentConfig(
response_modalities=[“Text”, “Image”]
),
)

Whether or not you might be constructing AI brokers, creating apps with lovely visuals like illustrated interactive tales, or brainstorming visible concepts in dialog, Gemini 2.0 Flash means that you can add textual content and picture technology with only a single mannequin. We’re desperate to see what builders create with native picture output and your suggestions will assist us finalize a production-ready model quickly.

[ad_2]

Source link

Tags: ExperimentFlashGeminiGenerationImageNative
Previous Post

Starknet Aims to Bridge Bitcoin and Ethereum

Next Post

Gemini Robotics brings AI into the physical world

Next Post
Gemini Robotics brings AI into the physical world

Gemini Robotics brings AI into the physical world

Binance Will Launch Viction (VIC) on Future Market

Binance Will Launch Viction (VIC) on Future Market

MEXC Ventures Advances Stablecoin Adoption with $36M Investment in Ethena and USDe

MEXC Ventures Advances Stablecoin Adoption with $36M Investment in Ethena and USDe

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Social icon element need JNews Essential plugin to be activated.

CATEGORIES

  • Analysis
  • Artificial Intelligence
  • Blockchain
  • Crypto/Coins
  • DeFi
  • Exchanges
  • Metaverse
  • NFT
  • Scam Alert
  • Web3
No Result
View All Result

SITEMAP

  • About us
  • Disclaimer
  • DMCA
  • Privacy Policy
  • Terms and Conditions
  • Cookie Privacy Policy
  • Contact us

Copyright © 2024 Digital Currency Pulse.
Digital Currency Pulse is not responsible for the content of external sites.

No Result
View All Result
  • Home
  • Crypto/Coins
  • NFT
  • AI
  • Blockchain
  • Metaverse
  • Web3
  • Exchanges
  • DeFi
  • Scam Alert
  • Analysis
Crypto Marketcap

Copyright © 2024 Digital Currency Pulse.
Digital Currency Pulse is not responsible for the content of external sites.