logo
Google's AlphaEvolve: The AI agent that reclaimed 0.7% of Google's compute – and how to copy it

Google's AlphaEvolve: The AI agent that reclaimed 0.7% of Google's compute – and how to copy it

Business Mayor17-05-2025

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More
Google's new AlphaEvolve shows what happens when an AI agent graduates from lab demo to production work, and you've got one of the most talented technology companies driving it.
Built by Google's DeepMind, the system autonomously rewrites critical code and already pays for itself inside Google. It shattered a 56-year-old record in matrix multiplication (the core of many machine learning workloads) and clawed back 0.7% of compute capacity across the company's global data centers.
Those headline feats matter, but the deeper lesson for enterprise tech leaders is how AlphaEvolve pulls them off. Its architecture – controller, fast-draft models, deep-thinking models, automated evaluators and versioned memory – illustrates the kind of production-grade plumbing that makes autonomous agents safe to deploy at scale.
Google's AI technology is arguably second to none. So the trick is figuring out how to learn from it, or even using it directly. Google says an Early Access Program is coming for academic partners and that 'broader availability' is being explored, but details are thin. Until then, AlphaEvolve is a best-practice template: If you want agents that touch high-value workloads, you'll need comparable orchestration, testing and guardrails.
Consider just the data center win. Google won't put a price tag on the reclaimed 0.7%, but its annual capex runs tens of billions of dollars. Even a rough estimate puts the savings in the hundreds of millions annually— enough, as independent developer Sam Witteveen noted on our recent podcast, to pay for training one of the flagship Gemini models, estimated to cost upwards of $191 million for a version like Gemini Ultra.
VentureBeat was the first to report about the AlphaEvolve news earlier this week. Now we'll go deeper: how the system works, where the engineering bar really sits and the concrete steps enterprises can take to build (or buy) something comparable.
AlphaEvolve runs on what is best described as an agent operating system – a distributed, asynchronous pipeline built for continuous improvement at scale. Its core pieces are a controller, a pair of large language models (Gemini Flash for breadth; Gemini Pro for depth), a versioned program-memory database and a fleet of evaluator workers, all tuned for high throughput rather than just low latency.
A high-level overview of the AlphaEvolve agent structure. Source: AlphaEvolve paper.
This architecture isn't conceptually new, but the execution is. 'It's just an unbelievably good execution,' Witteveen says.
The AlphaEvolve paper describes the orchestrator as an 'evolutionary algorithm that gradually develops programs that improve the score on the automated evaluation metrics' (p. 3); in short, an 'autonomous pipeline of LLMs whose task is to improve an algorithm by making direct changes to the code' (p. 1).
Takeaway for enterprises: If your agent plans include unsupervised runs on high-value tasks, plan for similar infrastructure: job queues, a versioned memory store, service-mesh tracing and secure sandboxing for any code the agent produces.
A key element of AlphaEvolve is its rigorous evaluation framework. Every iteration proposed by the pair of LLMs is accepted or rejected based on a user-supplied 'evaluate' function that returns machine-gradable metrics. This evaluation system begins with ultrafast unit-test checks on each proposed code change – simple, automatic tests (similar to the unit tests developers already write) that verify the snippet still compiles and produces the right answers on a handful of micro-inputs – before passing the survivors on to heavier benchmarks and LLM-generated reviews. This runs in parallel, so the search stays fast and safe.
In short: Let the models suggest fixes, then verify each one against tests you trust. AlphaEvolve also supports multi-objective optimization (optimizing latency and accuracy simultaneously), evolving programs that hit several metrics at once. Counter-intuitively, balancing multiple goals can improve a single target metric by encouraging more diverse solutions.
Takeaway for enterprises: Production agents need deterministic scorekeepers. Whether that's unit tests, full simulators, or canary traffic analysis. Automated evaluators are both your safety net and your growth engine. Before you launch an agentic project, ask: 'Do we have a metric the agent can score itself against?'
AlphaEvolve tackles every coding problem with a two-model rhythm. First, Gemini Flash fires off quick drafts, giving the system a broad set of ideas to explore. Then Gemini Pro studies those drafts in more depth and returns a smaller set of stronger candidates. Feeding both models is a lightweight 'prompt builder,' a helper script that assembles the question each model sees. It blends three kinds of context: earlier code attempts saved in a project database, any guardrails or rules the engineering team has written and relevant external material such as research papers or developer notes. With that richer backdrop, Gemini Flash can roam widely while Gemini Pro zeroes in on quality.
Unlike many agent demos that tweak one function at a time, AlphaEvolve edits entire repositories. It describes each change as a standard diff block – the same patch format engineers push to GitHub – so it can touch dozens of files without losing track. Afterward, automated tests decide whether the patch sticks. Over repeated cycles, the agent's memory of success and failure grows, so it proposes better patches and wastes less compute on dead ends.
Takeaway for enterprises: Let cheaper, faster models handle brainstorming, then call on a more capable model to refine the best ideas. Preserve every trial in a searchable history, because that memory speeds up later work and can be reused across teams. Accordingly, vendors are rushing to provide developers with new tooling around things like memory. Products such as OpenMemory MCP, which provides a portable memory store, and the new long- and short-term memory APIs in LlamaIndex are making this kind of persistent context almost as easy to plug in as logging.
OpenAI's Codex-1 software-engineering agent, also released today, underscores the same pattern. It fires off parallel tasks inside a secure sandbox, runs unit tests and returns pull-request drafts—effectively a code-specific echo of AlphaEvolve's broader search-and-evaluate loop.
AlphaEvolve's tangible wins – reclaiming 0.7% of data center capacity, cutting Gemini training kernel runtime 23%, speeding FlashAttention 32%, and simplifying TPU design – share one trait: they target domains with airtight metrics.
For data center scheduling, AlphaEvolve evolved a heuristic that was evaluated using a simulator of Google's data centers based on historical workloads. For kernel optimization, the objective was to minimize actual runtime on TPU accelerators across a dataset of realistic kernel input shapes.
Takeaway for enterprises: When starting your agentic AI journey, look first at workflows where 'better' is a quantifiable number your system can compute – be it latency, cost, error rate or throughput. This focus allows automated search and de-risks deployment because the agent's output (often human-readable code, as in AlphaEvolve's case) can be integrated into existing review and validation pipelines.
This clarity allows the agent to self-improve and demonstrate unambiguous value.
While AlphaEvolve's achievements are inspiring, Google's paper is also clear about its scope and requirements.
The primary limitation is the need for an automated evaluator; problems requiring manual experimentation or 'wet-lab' feedback are currently out of scope for this specific approach. The system can consume significant compute – 'on the order of 100 compute-hours to evaluate any new solution' (AlphaEvolve paper, page 8), necessitating parallelization and careful capacity planning.
Before allocating significant budget to complex agentic systems, technical leaders must ask critical questions:
Machine-gradable problem? Do we have a clear, automatable metric against which the agent can score its own performance?
Do we have a clear, automatable metric against which the agent can score its own performance? Compute capacity? Can we afford the potentially compute-heavy inner loop of generation, evaluation, and refinement, especially during the development and training phase?
Can we afford the potentially compute-heavy inner loop of generation, evaluation, and refinement, especially during the development and training phase? Codebase & memory readiness? Is your codebase structured for iterative, possibly diff-based, modifications? And can you implement the instrumented memory systems vital for an agent to learn from its evolutionary history?
Read More When to ignore — and believe — the AI hype cycle
Takeaway for enterprises: The increasing focus on robust agent identity and access management, as seen with platforms like Frontegg, Auth0 and others, also points to the maturing infrastructure required to deploy agents that interact securely with multiple enterprise systems.
AlphaEvolve's message for enterprise teams is manifold. First, your operating system around agents is now far more important than model intelligence. Google's blueprint shows three pillars that can't be skipped:
Deterministic evaluators that give the agent an unambiguous score every time it makes a change.
Long-running orchestration that can juggle fast 'draft' models like Gemini Flash with slower, more rigorous models – whether that's Google's stack or a framework such as LangChain's LangGraph.
Persistent memory so each iteration builds on the last instead of relearning from scratch.
Enterprises that already have logging, test harnesses and versioned code repositories are closer than they think. The next step is to wire those assets into a self-serve evaluation loop so multiple agent-generated solutions can compete, and only the highest-scoring patch ships.
As Cisco's Anurag Dhingra, VP and GM of Enterprise Connectivity and Collaboration, told VentureBeat in an interview this week: 'It's happening, it is very, very real,' he said of enterprises using AI agents in manufacturing, warehouses, customer contact centers. 'It is not something in the future. It is happening there today.' He warned that as these agents become more pervasive, doing 'human-like work,' the strain on existing systems will be immense: 'The network traffic is going to go through the roof,' Dhingra said. Your network, budget and competitive edge will likely feel that strain before the hype cycle settles. Start proving out a contained, metric-driven use case this quarter – then scale what works.
Watch the video podcast I did with developer Sam Witteveen, where we go deep on production-grade agents, and how AlphaEvolve is showing the way:

Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Google Makes New $200 Father's Day Offer To Pixel 9 Buyers
Google Makes New $200 Father's Day Offer To Pixel 9 Buyers

Forbes

timean hour ago

  • Forbes

Google Makes New $200 Father's Day Offer To Pixel 9 Buyers

MOUNTAIN VIEW, CALIFORNIA - AUGUST 13: A new Google Pixel 9 phone is displayed during the Made By ... More Google event at Google headquarters on August 13, 2024 in Mountain View, California. Google announced new Pixel phones, watches and AI features during the Made By Google event. (Photo by) Following one of its most aggressive Pixel 9 price drops yet, Google has once again slashed the price of its flagship smartphone. However, for serious deal hunters the company's new Pixel 8 Pro bundle offer might be the better deal. Let's start with the Pixel 9 discount. All models, including the Pixel 9a, are cheaper for this sale, which ends on June 15th. The Pixel 9 price cut is similar to what we have seen in previous sales, but Google does have some new bundle deals, alongside discounts for virtually all wearables. Virtually all of Google's wearables and smart home devices are on sale, too. These are the standout deals. A Google Pixel 8 pro phone is displayed during a product launch event for the Google Pixel 8, and ... More Pixel 8 pro phones, Pixel Watch 2, and Pixel Buds Pro earbuds, in New York on October 4, 2023. (Photo by Ed JONES / AFP) (Photo by ED JONES/AFP via Getty Images) The most eye-catching promotion is for the Pixel 8 Pro, which is now down to $599. Google lists this as a $400 discount, but the 2023 flagship hasn't cost $999 since it launched two years ago. Despite the chicanery, this is a solid deal for a phone that isn't wildly different from the Pixel 9. The promotion is boosted by Google throwing in a free Pixel Watch 2 LTE and a free case with the Pixel 8 Pro. To me, this makes the Pixel 8 Pro the better choice if you're on a budget because the free Pixel Watch can be sold to extend the discount. If you're worried about buying older tech, there's very little daylight between the Pixel 8 and Pixel 9 ranges. The Pixel 8 has many of the Pixel 9's features, including AI tools like Magic Editor, Best Take, and Video Boost (on the Pixel 8 Pro). Formerly exclusive Pixel 9 features, such as the Pixel Weather app, have now migrated over to the Pixel 8. Only the updated hardware separates the two phones. I wrote more extensively about why the Pixel 8 Pro may be the better choice for tight budgets here. Crucially, the Pixel 8 still has five years of software support left. When the device launched in 2023, Google extended the amount of Android updates and security patches its Pixel phones would get to seven years. We will have to see if Google (and Samsung) honor that promise as we edge closer to 2030. But if we take Google at face value, that extended support adds value to the Pixel 8, especially since there are fewer reasons to buy new hardware that only marginally improves per year. Manufacturers may struggle to convince fans to buy new phones when the near-identical predecessor is half the price. This is good news for Pixel 9 and Pixel 8 shoppers because it is currently the best time to buy high-end, cheap, phones.

5 AI tools I rely on more than ChatGPT
5 AI tools I rely on more than ChatGPT

Android Authority

time3 hours ago

  • Android Authority

5 AI tools I rely on more than ChatGPT

Kaitlyn Cimino / Android Authority ChatGPT is usually the first tool that comes to everyone's mind when you start talking about AI. It sure is versatile and capable of doing a lot — often better than others in many ways. Its memory feature helps it learn about you and fine-tune responses, making it more personalized than many other (even paid) alternatives. It's also the one to open the floodgates for a wave of AI tools designed for very specific tasks — the kind that even ChatGPT can't handle well. And that's exactly where the broader world of AI tools comes in. I've tried quite a few of these AI tools, and here are the ones that have truly embedded themselves into my daily workflow. I can't imagine getting through the day without them anymore. What's your go-to AI tool (besides ChatGPT)? 0 votes Gemini NaN % Perplexity NaN % Claude NaN % Something else (comment below!) NaN % Gemini Ryan Haines / Android Authority Gemini is similar to ChatGPT in many ways — and no, I don't use it to unload my life problems. But I do end up using it quite a lot, and the simple reason is its presence. Since I rely on Google products both personally and professionally, Gemini is always just there — sometimes in the sidebar, sometimes just a wake word away. On my Workspace for Business account, I often use it to check grammar, tighten language, or tweak the tone of emails. But I've also had it read contracts in Docs and point out anything working against me or take notes during Meet calls and summarize the meeting within minutes of it ending. It even handles different accents in a single meeting quite well. Perplexity Calvin Wankhede / Android Authority If I've developed muscle memory for using ChatGPT, Perplexity has quietly replaced Google Search for most of my web lookups — especially on desktop. I use it heavily for research. Instead of browsing ten websites, scrubbing YouTube videos, or combing through forums for one specific detail — like what the most popular desktop OS in Europe was in the early 2010s — I just ask Perplexity. And much like appending 'Reddit' to a search to get real user opinions instead of content written for SEO, Perplexity does that for you. It gives you a concise gist, which is handy when I'm looking for general sentiment, say, around viral news. While it saves me the search legwork, I still verify what it gives me just to make sure I'm delivering accurate information with the necessary human oversight. NotebookLM Andy Walker / Android Authority For the longest time, I avoided NotebookLM. It felt like a niche product good for only a few tasks — but that's exactly where its strength lies. And honestly, what a tool Google has made! You can create multiple notebooks and upload source documents, even on the free plan. It analyzes long documents and can surface different perspectives, themes, and patterns — like a real brainstorming partner. That's not what I personally use it for, though. NotebookLM is a true example of AI's potential — I just hope it doesn't land in the Google graveyard anytime soon. What I love is its ability to switch tone easily between 'Guide' and 'Analyst' modes, especially when I need a more conversational or direct approach. But my favorite feature is its podcast-style audio summaries. Those have helped me digest jargon-heavy, never-ending documents while driving or cooking — and suddenly, it doesn't feel like work! NotebookLM is a true example of AI's potential — I just hope it doesn't land in the Google graveyard anytime soon. Notion AI Dhruv Bhutani / Android Authority Some AI tools become part of your routine without you even realizing it — and Notion AI is one of those for me. I initially dismissed it as something Notion was forcing on its users, but I've ended up using it more than I expected. I use Notion for almost all my long-form personal writing — blog posts, short stories, you name it. One thing that's always frustrated me is the lack of autocorrect. When I'm in vomit-draft mode, I don't care about typos or grammar and cleaning them up later was always a pain. With Notion AI, I can fix all that with just a couple of clicks. I can also throw in unformatted lists (groceries, travel packing, etc.) and just ask AI to clean them up. I even use it to brainstorm multiple angles for blog ideas, helping me avoid getting stuck on one track. It's like the second set of eyes I have always wanted for my blogs. The free plan gives me limited prompts, but since I only use Notion once or twice a day, I get by just fine. Ideogram An image generation tool has been eerily missing from this list — that's because I saved the best for the last! Ideogram has been my preferred tool for that because of one big reason. It is one of the only free tools that lets you customize and control a lot of aspects of your generated images, including their size and ratio. Most AI tools generate square images that are terrible for online use as feature images or on social media. Ideogram gets you a few free credits per week and creates some fantastic AI images using its latest-generation model with whatever customization you want. And it also has something called magic prompt that uses AI to create an elaborate prompt on your behalf. We all tend to underexplain AI the exact scene we want, but Ideogram covers you for that. Specialized AI tools are far more useful companions than a chatbot that behaves like an over-eager intern who always needs direction. AI beyond ChatGPT We've had AI around us for years — from Gboard's smart suggestions to Google Assistant — but it wasn't until ChatGPT became a buzzword that we really started noticing generative AI in our everyday lives. It's honestly hard to believe it's just been a couple of years since its arrival. A lot of AI tools have emerged in such a short span, and many of them have surely become an indispensable part of my life. But most importantly, I get to use them to be more productive without fearing about AI dimming my creative spark. They are far more useful companions than a chatbot that behaves like an over-eager intern who always needs direction.

Boosting Shopify Speed & Performance Optimization With the Help of Shop Gait
Boosting Shopify Speed & Performance Optimization With the Help of Shop Gait

Time Business News

time4 hours ago

  • Time Business News

Boosting Shopify Speed & Performance Optimization With the Help of Shop Gait

Every millisecond counts in an eCommerce business, and your Shopify store is no exception. Frustrated users, low conversions, poor search engine rankings, and even worse are the results of slow-loading websites. There is no way to accomplish desired outcomes if optimization is a mere good-to-have feature. It is fundamental to achieve exceptional outcomes. At Shop Gait, we have pieces of advice to ease your journey. Offering Shopify speed & performance optimization guarantees that no further optimization is needed at the user end. It improves ease of use, search engine ranking, and ultimately, sales. This blog will help you with the analysis of the Shopify Support & Maintenance Service and the Shopify API Integration Service to a very advanced level of optimization. Think of a website you would like to visit, but takes the life out of you waiting to load. By all means, a website that loads quickly is user-friendly and highly likely to improve customer satisfaction. In most cases, customers will not leave the website after loading if it has multiple pages. Along with other loading and navigational features, your website's speed impacts its SEO ranking. This means Shopify stores that load slowly are losing potential site visits. This is because Google algorithmically gives preference to sites that load faster relative to others. Therefore, if you have a slow Shopify store, implement Shopify Speed & Performance Optimization, and Shopify Support & Maintenance Service. With that in mind, here are the most common reasons that can help in Shopify Speed & Performance Optimization: – Non-optimized, oversized images – Unoptimized Shopify themes – Excessive use of applications (apps) – Poor coding – Optimization of content delivery is absent Knowing these reasons puts you ahead in trying to improve your store's performance. If you're looking to improve your store's website, evaluating your current statistics is the first step. Google PageSpeed Insights, GTmetrix, and the integrated tools of Shopify can serve as a foundation for Shopify Speed & Performance Optimization. Aside from the services offered, these tools also provide Shopify Support & Maintenance Service, and an insightful explanation of the sources of lagging performance. The metrics that are most analyzed when accessing website performance include: First Contentful Paint (FCP)—the time it takes from the moment a user initiates the page load until the first piece of content appears. Time To Interactive (TTI)—how much time it takes till the webpage becomes completely ready for interaction. Largest Contentful Paint (LCP)— time required till the biggest portion of the webpage is loaded Cumulative Layout Shift (CLS)— Searches for disrupting layout shifts that are not expected and pose a risk to user experience. These metrics help you understand the current state of your store and the most pressing areas for improvement. Image optimization deserves attention because images are often the most time-consuming element on a page. Choose the Most Optimal Formats: Formats such as WebP provide unparalleled quality lossless compression. Formats such as WebP provide unparalleled quality lossless compression. Compress Image Files: Use TinyPNG to decrease the file size without losing clarity. Alternatively, there's also the option of using Shopify Apps. Use TinyPNG to decrease the file size without losing clarity. Alternatively, there's also the option of using Shopify Apps. Use Lazy Loading: Only the images visible on the user's screen will be loaded, and this will decrease both load time and speed. Shopify theme is critical to the performance of your store. Select a Lightweight Theme: Select a responsive and fast theme. Shopify offers the free Dawn theme and other paid options such as Turbo. Select a responsive and fast theme. Shopify offers the free Dawn theme and other paid options such as Turbo. Remove Unused Features: Get rid of all unused animations, 3rd party widgets, and integrations that clutter your theme. Get rid of all unused animations, 3rd party widgets, and integrations that clutter your theme. Clean Up Theme Code: Reduce the CSS and JavaScript code of the theme to remove bloat that is slowing your site. While apps can improve the utility of your store, overusing them can severely cripple performance: Audit Your Apps: Evaluate your app installations regularly and promptly uninstall those that are no longer useful. Evaluate your app installations regularly and promptly uninstall those that are no longer useful. Choose Optimized Apps: Assure that the reviews of an app you want to install are favorable and that it is optimized to have minimal effect on your store's speed. Another aspect of enhancing performance is efficient coding: Minify Resources: Reduce the size and loading time of files by minifying CSS, JavaScript, and HTML files. Reduce the size and loading time of files by minifying CSS, JavaScript, and HTML files. Defer Non-Critical Resources: Load only the most important elements first, deferring scripts that are not immediately needed. Load only the most important elements first, deferring scripts that are not immediately needed. Enable Browser Caching: Files like images, CSS files, and HTML can be kept on the user's computer via caching, making future visits faster. A Content Delivery Network (CDN) refers to a network of servers distributed globally for serving content from the geographical region closest to the user. Shopify already has a built-in CDN that can be optimized by: Integrating with external CDNs such as Cloudflare for redundancy and speed. Exploring the CDN's platform for other options, such as caching for performance boosts. Database Optimization Unused data in your Shopify database can contribute to lag: Regularly remove old or inactive products, images, and customer records. Utilize Shopify apps like Matrixify to clean up and organize your inventory data. With most of the traffic coming from mobile devices, it is essential to optimize mobile access: Responsive Design: Check that your store is mobile-enabled with its texts, buttons, and small diagrams arranged to suit smaller screens. Check that your store is mobile-enabled with its texts, buttons, and small diagrams arranged to suit smaller screens. Accelerated Mobile Pages (AMP): Adding AMP to serve light mobile pages that load without delay. If you use Shopify Plus, you have access to sophisticated methods and tools: Script Editor: Implement the Shopify Plus Script Editor for personalized checkout workflows and enhance performance. Implement the Shopify Plus Script Editor for personalized checkout workflows and enhance performance. Automated Testing: Regularly execute A/B testing to evaluate loading time and optimize it to the best possible level. Regularly execute A/B testing to evaluate loading time and optimize it to the best possible level. API Integration Services: Utilize the Shopify API Integration Services for integration with other platforms, which enhances operational efficiency by automating manual workflows. Ensuring that your Shopify store runs at peak levels of speed and performance is critical for competitiveness, enhances user experience, and contributes positively to SEO for professionals? Shop Gait offers unique, tailored Shopify Support & Maintenance Services to enhance store performance. We also provide an equally capable Shopify API Integration Service, through which our professionals will enable seamless integration to bring your store to pro status. TIME BUSINESS NEWS

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store