Soon, you can upload video on Gemini app and ask questions about it: Report

Google's Gemini app is reportedly getting a new capability that will let users upload videos in the chat and get an analysis done on them. So far, users have been able to upload pictures and documents for the artificial intelligence chatbot to analyse or scan. However, soon they will also be able to upload videos in the prompt. According to a report by 9To5Google, Gemini will analyse the video and let users ask questions about it.
9To5Google tested the feature by sharing a video and asking the AI bot to describe the video, which it did pretty accurately. It is to be noted that Gemini video upload feature has not yet been rolled out widely. The availability of the feature varies depending on accounts/devices that 9To5Google checked. However, this feature will reportedly be made available to both free and paid users across Android (Google app 16.23 beta) and iOS, as well as 2.5 Flash and 2.5 Pro. The feature is not live on the web interface yet.
Video in Gemini: How to use
Open the plus (+) menu to upload a file.
Select Gallery or Files from the options.
If video upload is available for your account, you'll be able to select video files.
If not, video files will appear grayed out and cannot be uploaded.
In other related news, Google officially rolled out its Gemini 2.5 series of AI models on Tuesday, making them widely accessible. As part of the launch, users can now interact with the stable releases of both Gemini 2.5 Pro and Gemini 2.5 Flash. The tech giant has also extended access to the Pro model for users on the free tier of the Gemini platform. Alongside these, Google introduced Gemini 2.5 Flash-Lite — touted as the company's fastest and most cost-effective AI model to date.

Hashtags

Finance

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Google Pixel 10 series may feature dual speakers and SIM tray shift, leak hints

Mint

30 minutes ago

Mint

Google Pixel 10 series may feature dual speakers and SIM tray shift, leak hints

Google's next-generation flagship smartphone lineup, expected to include the Pixel 10, Pixel 10 Pro, Pixel 10 Pro XL, and the foldable Pixel 10 Pro Fold, has been the subject of numerous leaks in recent weeks. Now, a newly surfaced report offers fresh insights into the design of the Pixel 10 and Pixel 10 Pro—this time via an alleged protective case leak. According to Android Authority, the protective case comes from Thinborne, a well-known accessory maker. The company has reportedly developed cases for the Pixel 10 and Pixel 10 Pro ahead of their official unveiling. The leaked case design suggests that while the upcoming phones may appear similar to last year's Pixel 9 and Pixel 9 Pro, there are subtle shifts in dimensions and hardware placements that indicate noteworthy updates. The report claims that the Pixel 9 Pro can fit snugly into the case made for the Pixel 10, hinting at comparable sizing between the generations. However, the alignment of key elements such as port cutouts and the camera bar tells a different story. Visuals of the leaked case suggest that the USB-C port and other bottom-edge components have been repositioned. Unlike the single-speaker cutout seen in previous models, the Pixel 10 and Pixel 10 Pro are expected to feature dual-speaker cutouts on the underside of the device. Additionally, the SIM tray, traditionally located on the side or bottom, appears to have been moved to the top edge in the new design. There are further indications that the rear camera system has been slightly revamped. The case does not align perfectly with the camera bar of the Pixel 9 Pro, leading to speculation that the Pixel 10's camera module will be marginally larger. The case also partially obscures the 5G antenna on the older model, reinforcing the theory that external hardware elements have shifted enough to warrant a redesign. These changes, though subtle, could mean existing Pixel 9 cases will not be compatible with the Pixel 10, despite the superficial resemblance between the two generations.

Google Veo 3: Creative Breakthrough or Crisis for Journalism?

The Hindu

an hour ago

The Hindu

Google Veo 3: Creative Breakthrough or Crisis for Journalism?

Published : Jun 18, 2025 18:51 IST - 5 MINS READ Launched in May 2025 at Google's annual I/O developer conference, Google Veo 3 is the tech giant's direct challenge to Microsoft-backed OpenAI's video generation model, Sora. Developed by Google DeepMind, the advanced model marks a major leap in generative AI, promising high-quality, realistic video creation from text or image prompts. But in an age flooded with misinformation and deepfakes, a tool like Veo 3—with its ability to produce lifelike video and synchronised audio—raises pressing questions for journalism. It opens new creative possibilities, yes, but also invites serious challenges around credibility, misuse, and editorial control. What is Google Veo? Veo 3 touts itself as a 'cutting-edge tool' offering 'unmatched realism, audio integration, and creative control'. It comes at a high price—$249.99/month under the AI Ultra plan—and is currently available in the US and 71 other countries, excluding India, the EU, and the UK. Ethical concerns loom, but Google pitches Veo as a powerful resource for filmmakers, marketers, and developers. According to Google, Veo 3 can generate 4K videos with realistic physics, human expressions, and cinematic style. Unlike many competitors, it also produces synchronised audio—dialogue, ambient noise, background music—adding to the illusion of realism. Also Read | When AI breaks the law, who gets arrested—the bot or its maker? The model is designed to follow complex prompts with precision, capturing detailed scenes, moods, and camera movements. Users can specify cinematic techniques like drone shots or close-ups, and control framing, transitions, and object movement. A feature called 'Ingredients' allows users to generate individual elements—like characters or props—and combine them into coherent scenes. Veo can also extend scenes beyond the frame, modify objects, and maintain visual consistency with shadows and spatial logic. Google's website features examples of Veo in action, including projects in marketing, social media, and enterprise applications. The Oscar-nominated filmmaker Darren Aronofsky used it to create a short film, Primordial Soup. On social media, AI artists have released viral Veo clips like Influenders, a satire featuring influencers at the end of the world. Veo 3 is integrated into Google's AI filmmaking tool Flow, which allows intuitive prompting. Enterprise access is available via Vertex AI, while general users in supported countries can use it through Google's Gemini chatbot. The journalism dilemma Veo's features raise alarms about potential misuse. It could facilitate the creation of deepfakes and false narratives, further eroding trust in online content. There are also broader concerns about its economic impact on creators, legal liabilities, and the need for stronger regulation. The risks are not theoretical. As highlighted in a June 2025 TIME article, titled 'Google's Veo 3 Can Make Deepfakes of Riots, Election Fraud, Conflict', Veo was used to generate realistic footage of fabricated events—like a mob torching a temple or an election official shredding ballots—paired with false captions designed to incite unrest. Such videos could spread rapidly, with real-world consequences. Cybersecurity threats—like impersonating executives to steal data—are also plausible, alongside looming copyright issues. TIME reported that Veo may have been trained on copyrighted material, exposing Google to lawsuits. Meanwhile, Reddit forums cite personal harms, such as a student jailed after AI-generated images were falsely attributed to them. There is also the threat to livelihoods. AI-generated content could displace human creators, particularly YouTubers and freelance editors, accelerating what some call the 'dead internet'—a space overrun by AI-generated junk media. To mitigate risk, Google claims that all Veo content includes an invisible SynthID watermark, with a visible one in most videos (though it can be cropped or altered). A detection tool for SynthID is in testing. Harmful or misleading prompts are blocked, but troubling content has still emerged, highlighting the limits of guardrails. What should newsrooms do? Despite the risks, Veo presents compelling opportunities for journalism—particularly for data visualisation, explainer videos, recreating historical events, or reporting on under-documented stories. It can help small newsrooms produce professional-quality videos quickly and affordably, even for breaking news. Used responsibly, Veo could improve storytelling—turning eyewitness accounts of a disaster into a visual narrative, for instance, or transforming dry data into cinematic sequences. Prototyping ideas before committing to full production becomes more feasible, especially for digital-first outlets. But Veo's strengths are also its dangers. Its ability to produce convincing footage of events that never happened could destabilise the information ecosystem. If deepfakes flood the news cycle, real footage may lose credibility. The visible watermark is easily removed, and Google's SynthID Detector remains limited in scope, giving malicious actors room to operate undetected. To maintain public trust, newsrooms must clearly disclose when content is AI-generated. Yet the temptation to pass off fabricated visuals as real—especially in competitive, high-pressure news environments—will be strong. And because AI outputs reflect their training data, biases could sneak in, requiring rigorous editorial scrutiny. There is also the human cost. Veo's automation could eliminate roles for video editors, animators, and field videographers, especially in resource-strapped newsrooms. Journalists may need to learn prompt engineering and AI verification just to stay afloat. Also Read | AI is changing work, privacy, and power—what comes next? The legal landscape is murky. If an outlet publishes an AI-generated video that causes harm, accountability is unclear. Ownership of Veo-generated content also remains opaque, raising potential copyright disputes. And then there is the burden of verification. Fact-checkers will face a deluge of synthetic content, while reporters may find their own footage treated with suspicion. As the Pew Research Center reported in 2024, three in five American adults were already uneasy about AI in the newsroom. A critical juncture As Veo and tools like it become cheaper and more widely available, their impact on journalism will deepen. The challenge is not simply to resist the tide but to adapt—ethically, strategically, and urgently. According to experts, newsrooms must invest in training, transparency, and detection tools to reap the creative rewards of AI while safeguarding credibility. Innovation and trust must evolve together. If journalism is to survive this next phase of disruption, it must do so with eyes wide open, they say. (Research by Abhinav Chakraborty)

Google rolls out budget-friendly Gemini 2.5 Flash Lite, opens 2.5 Flash and Pro to all

India Today

2 hours ago

India Today

Google rolls out budget-friendly Gemini 2.5 Flash Lite, opens 2.5 Flash and Pro to all

Google has introduced a new addition to its Gemini AI model line-up — the Gemini 2.5 Flash-Lite. According to Google, this new AI model can deliver high performance at the lowest cost and fastest speeds yet. Alongside the new model, the company has announced the general availability of the Gemini 2.5 Flash and Pro models to all says that Gemini 2.5 Flash-Lite is its most affordable and fastest model in the 2.5 family. It has been built to handle large volumes of latency-sensitive tasks such as translation, classification, and reasoning at a lower computational cost. Compared to its predecessor, 2.0 Flash-Lite, the new model is said to deliver improved accuracy and quality across coding, maths, science, reasoning, and multimodal benchmarks. 'It excels at high-volume, latency-sensitive tasks like translation and classification, with lower latency than 2.0 Flash-Lite and 2.0 Flash on a broad sample of prompts,' says Google. advertisementGoogle highlights that despite being lightweight, 2.5 Flash-Lite comes with a full suite of advanced capabilities. These include support for multimodal inputs, a 1 million-token context window, integration with tools like Google Search and code execution, and the flexibility to modulate computational thinking based on budget. According to the company, these features make the Gemini 2.5 Flash-Lite ideal for developers looking to balance efficiency with robust AI 2.5 Flash-Lite availability The Gemini 2.5 Flash-Lite model is currently available in preview via Google AI Studio and Vertex AI. Google has also integrated customised versions of 2.5 Flash-Lite and Flash into its core products like Search, expanding their reach beyond developers to everyday 2.5 Flash and Pro models now available to allIn addition to introducing Flash-Lite, Google has also announced that its Gemini 2.5 Flash and Gemini 2.5 Pro models are now stable and generally available. These models were previously accessible to a select group of developers and organisations for early production to Google, companies like Snap, SmartBear, and creative tools provider Spline have already integrated these models into their workflows with encouraging results. Now that Flash and Pro are fully open, developers can use them in production-grade applications with greater the stable and preview models can be accessed through Google AI Studio, Vertex AI, and the Gemini app.