10-07-2025
- Business
- Business Standard
Google focuses on India's farms and languages in new AI initiatives
Google launched open-source AI initiatives on Thursday targeting India's agriculture sector and cultural representation in artificial intelligence models. The tech giant introduced its Agricultural Monitoring and Event Detection (AMED) API, which tracks crop and field data across India to help developers create farming productivity tools. Researchers at Google DeepMind also partnered with IIT-Kharagpur through the company's Amplify Initiative to build datasets capturing India's linguistic and cultural diversity for integration into large language models.
These developments build on Google's sustained investments and commitment to AI research that drives real-world impact across critical areas while also supporting India's AI-focused ambitions.
"We've been inspired by the solutions India's innovators have unlocked with these capabilities, demonstrating AI to be a powerful catalyst for multiplier impact and unprecedented effectiveness,' said Dr. Manish Gupta, senior director for India and APAC at Google DeepMind, during a roundtable in Bengaluru.
Google DeepMind and the Partnerships Innovation team have developed the AMED API to improve agricultural monitoring across India. Building on the company's Agricultural Landscape Understanding (ALU) API, the new tool uses machine learning, crop labels, and satellite imagery to identify crop types, field sizes, and sowing and harvesting dates. It also offers three years of historical data to track agricultural activity at the field level.
These insights aim to help develop AI-driven solutions that improve farm management by addressing crop-specific needs such as soil, water, growth patterns, and climate, while also forecasting harvest volumes.
Alok Talekar, Lead of Agriculture and Sustainability Research at Google DeepMind, said the firm is working on accelerating crucial shifts, transforming broad insights into granular, real-time data. 'So that increasingly impactful solutions not only translate into benefits for India's farmers but also bolster the nation against rising climate risks,' said Talekar.
TerraStack, a startup incubated at IIT-Bombay, has used the ALU API to build a rural land intelligence system. The aim is to support rural lending, land record modernization, and determine the vulnerability of farms to climate risk. It is exploring the AMED API for a rural lending use case.
'These APIs are helping standardize and transform previously unorganised and unusable data into solutions for one of India's most critical sectors,' said Aaryan Dangi, co-founder and CEO of TerraStack.
Linguistic Diversity
Google's Amplify Initiative seeks to improve large language models by incorporating localized data—including regional languages, dialects, and cultural nuances—missing from current AI training. Partnering with IIT-Kharagpur, the project will develop high-quality, hyperlocal datasets capturing India's linguistic diversity.
The open-source datasets aim to help developers create AI tools that better serve Indian language users. Data collection follows a community-driven, expert-vetted process to ensure responsible handling and reduce bias.
After a pilot in Sub-Saharan Africa producing 8,000 annotated queries across seven languages, the India phase will focus on healthcare and safety topics in multiple Indic languages.
'We are meticulously building the rich, hyperlocal context and cultural understanding that transforms raw information into profound knowledge,' said Madhurima Maji, lead program manager for the Amplify Initiative for India at Google.
Dr. Mainack Mandal, Assistant Professor at IIT Kharagpur, said the collaboration opens a new chapter in global AI development.
The Amplify Initiative builds on Google's broader push to improve Indian language and cultural representation in AI, alongside its flagship Project Vaani. Developed with the Indian Institute of Science (IISc) Bangalore, Project Vaani has released its second-phase Indic speech data through Bhashini and Hugging Face.
So far, the initiative has contributed nearly 21,500 hours of speech audio and 835 hours of transcribed data across 86 languages, collected from over 112,000 speakers in 120 districts. The open-source data aims to support AI tools tailored to India's linguistic diversity.
'This support fuels our continued investments in language and culture research, and drives us to make our foundational models, on which India is building its AI ambition, more effective and efficient in processing Indian languages,' said Dr. Partha Talukdar, language research lead at Google DeepMind.
Google said its AI models are being used across sectors in India, from improving maternal health programs and streamlining patient care to supporting agri-tech solutions and advancing the country's sovereign AI efforts.
The AlphaFold Protein Structure Database—developed by Google DeepMind and now used by over 150,000 researchers in India—is aiding work on complex diseases such as cancer and autoimmune disorders.
With a focus on collaboration and ecosystem-driven innovation, Google said it aims to drive broad, real-world impact through AI in India.