logo
Is the cloud the wrong place for AI?

Is the cloud the wrong place for AI?

Yahoo2 days ago
When you buy through links on our articles, Future and its syndication partners may earn a commission.
The enterprise software playbook seemed clear: everything moves to the cloud eventually. Applications, databases, storage: they all followed the same inevitable arc from on-premises to software-as-a-service.
But with the arrival and boom of artificial intelligence, we're seeing a different story play out, one where the cloud is just one chapter rather than the entire book.
AI systems
AI workloads are fundamentally different beasts than the enterprise applications that defined the cloud migration wave. Traditional software scales predictably, processes data in batches, and can tolerate some latency.
AI systems are non-deterministic, require massive parallel processing, and often need to respond in real-time. These differences reshape the entire economic equation of where and how you run your infrastructure.
Take the challenge of long-running training jobs. Machine learning models don't train on schedule; they train until they converge. This could be hours, days, or weeks. Cloud providers excel at providing infrastructure at short notice, but GPU capacity at hyperscalers can be hard to get without a 1 year reservation.
The result is either paying for guaranteed capacity you might not fully use, or risking that your training job gets interrupted when using spot instances to reduce costs.
Then there's the inference challenge. Unlike web applications that might see traffic spikes during Black Friday, AI services often need to scale continuously as customer usage grows.
The token-based pricing models that govern large language models make this scaling unpredictable in ways that traditional per-request pricing never was. A single customer query might consume 10 tokens or 10,000, depending on the complexity of the response and the size of the context window.
Hybrid approaches
The most intriguing development involves companies discovering hybrid approaches that acknowledge these unique requirements rather than abandoning the cloud. They're using on-premises infrastructure for baseline, predictable workloads while leveraging cloud resources for genuine bursts of demand.
They're co-locating servers closer to users for latency-sensitive applications like conversational AI. They're finding that owning their core infrastructure gives them the stability to experiment more freely with cloud services for specific use cases.
This evolution is being accelerated by regulatory requirements that simply don't fit the cloud-first model. Financial services, healthcare, and government customers often cannot allow data to leave their premises.
For these sectors, on-premises or on-device inference represents a compliance requirement rather than a preference. Rather than being a limitation, this constraint is driving innovation in edge computing and specialized hardware that makes local AI deployment increasingly viable.
Infrastructure strategies
The cloud providers aren't standing still, of course. They're developing AI-specific services, improving GPU access, and creating new pricing models. But the fundamental mismatch between AI's resource requirements and traditional cloud economics suggests that the future won't be a simple rerun of the SaaS revolution.
Instead, we're heading toward a more nuanced landscape where different types of AI workloads find their natural homes. Experimentation and rapid prototyping will likely remain cloud-native. Production inference for established products might move closer to owned infrastructure. Training runs might split between cloud spot instances for cost efficiency and dedicated hardware for mission-critical model development.
The approach represents a step toward infrastructure strategies that match the actual needs of AI systems rather than forcing them into patterns designed for different types of computing
The most successful AI companies of the next decade will likely be those that think beyond the cloud-first assumptions and build infrastructure strategies as sophisticated as their algorithms.
We've featured the best cloud storage.
This article was produced as part of TechRadarPro's Expert Insights channel where we feature the best and brightest minds in the technology industry today. The views expressed here are those of the author and are not necessarily those of TechRadarPro or Future plc. If you are interested in contributing find out more here: https://www.techradar.com/news/submit-your-story-to-techradar-pro
Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Healthy Position In AI-driven Memory Market Aids Micron Technology (MU), Says Parnassus Investments
Healthy Position In AI-driven Memory Market Aids Micron Technology (MU), Says Parnassus Investments

Yahoo

time24 minutes ago

  • Yahoo

Healthy Position In AI-driven Memory Market Aids Micron Technology (MU), Says Parnassus Investments

Micron Technology, Inc. (NASDAQ:MU) is one of the Most Undervalued Semiconductor Stocks to Buy According to Analysts. In its Q2 2025 investor letter, Parnassus Investments stated that the company's stock is being helped by a healthy position in the broader AI-driven memory market. Furthermore, the management highlighted the strong demand in its latest quarter. In Q3 2025, Micron Technology, Inc. (NASDAQ:MU)'s data center revenue more than doubled YoY, reaching a quarterly record, while consumer-oriented end markets witnessed robust sequential growth. A close-up view of a computer motherboard with integrated semiconductor chips. Coming to Micron Technology, Inc. (NASDAQ:MU)'s end markets, in the data center, the company projects the CY 2025 server market to rise by mid-single digits percentage in units, mainly due to the significant growth in AI servers. Micron Technology, Inc. (NASDAQ:MU) anticipates PC market units to rise in the low single-digit percentage range in CY 2025. In the upcoming quarters, critical catalysts for growth consist of elevated adoption of AI-enabled PCs and the Windows 11 upgrade cycle. Micron Technology, Inc. (NASDAQ:MU) remains focused on bringing differentiated high-performance products to the PC market. While we acknowledge the potential of MU as an investment, we believe certain AI stocks offer greater upside potential and carry less downside risk. If you're looking for an extremely undervalued AI stock that also stands to benefit significantly from Trump-era tariffs and the onshoring trend, see our free report on the best short-term AI stock. READ NEXT: 13 Cheap AI Stocks to Buy According to Analysts and 11 Unstoppable Growth Stocks to Invest in Now Disclosure: None. This article is originally published at Insider Monkey. Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

D.C. Unleashes ChatGPT and Gemini -- AI Now Federal-Grade
D.C. Unleashes ChatGPT and Gemini -- AI Now Federal-Grade

Yahoo

time24 minutes ago

  • Yahoo

D.C. Unleashes ChatGPT and Gemini -- AI Now Federal-Grade

The U.S. government just gave AI its biggest public-sector runway yet. On Tuesday, the General Services Administration quietly approved OpenAI, Google (NASDAQ:GOOG), and Anthropic as official AI vendors through its Multiple Award Scheduleessentially a federal fast pass for enterprise software. That means agencies can now bypass red tape and deploy tools like ChatGPT, Gemini, and Claude with pre-negotiated terms. For tech investors, this isn't just a signalit's a megaphone. These models, once limited to pilot programs and national security experiments, could now be embedded across departments like Treasury, Personnel Management, and more. Warning! GuruFocus has detected 6 Warning Signs with PFE. The timing is no coincidence. Just days ago, President Donald Trump signed a set of executive orders aimed at reshaping how AI is procured, including a clause requiring federal agencies to avoid ideological bias in language models. GSA officials were quick to clarify that the approved vendors weren't picked as winnersjust first to clear the contracting hurdles. Still, the implications are hard to ignore. This opens up a multi-billion-dollar channel for enterprise AI, and the early players could gain embedded access across dozens of federal workflows. Historically, GSA has used its scale to hammer down prices from software giants like Adobe and Salesforce. Similar pricing pressure could now apply to LLM vendorsbenefiting adoption, if not margins. Multiple agencies are already outlining use casesfrom fraud detection and grant reviews to policy comment summarization and internal chatbot assistants. The Office of Personnel Management wants to use AI to digest tens of thousands of public responses on regulatory changesa task that once took months. But as OPM Director Scott Kupor put it: We're probably missing people who are super conversant with very modern, AI-related stuff. Translation: software alone won't solve it. Hiring and execution will matter. Still, this latest GSA approval could be a meaningful step toward widespread institutional adoptionand a new chapter in the AI enterprise race. This article first appeared on GuruFocus. Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

Palantir stops the show: Opening Bid top takeaway
Palantir stops the show: Opening Bid top takeaway

Yahoo

time24 minutes ago

  • Yahoo

Palantir stops the show: Opening Bid top takeaway

The AI trade continues to steal the show. Palantir (PLTR) has the hottest ticker page on Yahoo Finance today after the company's stellar earnings last night. The defense tech play broke $1 billion in sales in a quarter for the first time. Co-founder and CEO Alex Karp — wearing a white T-shirt on the earnings call — signaled the AI revolution is just beginning and haters should consider giving up their bear cases on the stock. "So thank you for all of our supporters, and we should entertain questions, but this is a once-in-a-generation truly anomalous quarter, and we're very proud, and we're sorry that our haters are disappointed, but there are many more quarters to be disappointed, and we're working on that too," Karp opined. Pouncing on the AI optimism, Bank of America published a bullish note on Nvidia (NVDA) weeks ahead of its Aug. 27 earnings report. Analyst Vivek Arya said Nvidia's quarterly sales will easily beat estimates and third quarter guidance will be upbeat due to Blackwell chip demand. If there is a hiccup in the quarter and outlook, Arya warns it could come from China — where Nvidia's H20 chip may be subject to potential security probes by Chinese regulators. Arya's $220 price target assumes about 22% upside in Nvidia's stock from current levels. "It's really gravity-defying how much capex is going into AI," Lou Basenese, chief strategist at the Basenese Group, told me on Yahoo Finance's Opening Bid. All this as investors buy the slight dip in the markets ahead of a potential Fed rate cut in September. But without question, Palantir's results left a mark on the minds of investors. Zoom-in: Palantir hits it out of the park Palantir cleaned up on government business and large companies in the quarter. Results beat estimates and accelerated from the first quarter. Karp took to his soapbox on the earnings call to slam critics and hype up his business. Here are the earnings day wins: Sales up 48% from the prior year. US commercial operating margins advanced 890 basis points year over year. The top line accelerated sequentially by major business segment. Bookings growth accelerated sequentially. Full-year sales guidance was raised by 6 percentage points to 45% year over year vs. previous expectation of 36% year over year. The implied fourth quarter revenue guidance was 10 percentage points ahead of consensus, implying revenue growth in the low-40s versus consensus expectations of 32%. "Palantir delivered a show-stopping Q2," Citi analyst Tyler Radke said. Shares rose 8% on the session. The question now is how far Palantir's stock could run into the third quarter earnings release in a few months. It will be hard to poke a hole in the bullish narrative, even in the face of an outsized valuation. Palantir had a shockingly good quarter, and guidance was impressive. The AI revolution only seems to be speeding up, judging by earnings results from Microsoft (MSFT), Amazon (AMZN), and Alphabet (GOOG, GOOGL) in recent weeks. "We believe in the next few years Palantir has the potential to be a trillion dollar market cap as the AI Revolution takes hold," said Wedbush analyst Dan Ives. Palantir's current market cap: $410 Sozzi is Yahoo Finance's Executive Editor and a member of Yahoo Finance's editorial leadership team. Follow Sozzi on X @BrianSozzi, Instagram, and LinkedIn. Tips on stories? Email Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store