
Six Ways To Advance Modern Architecture For AI Systems
These days, many engineering teams are coming up against a common problem – basically speaking, the models are too big. This problem comes in various forms, but there's often a connecting thread and a commonality to the challenges.
Project are running up against memory constraints. As parameters range into the billions and trillions, data centers have to keep up. Stakeholders have to look out for thresholds in vendor services. Cost is generally an issue.
However, there are new technologies on the horizon that can take that memory footprint and compute burden, and reduce them to something more manageable.
How are today's innovators doing this?
Let's take a look.
Input and Data Compression
First of all, there is the compression of inputs.
You can design a loss algorithm to compress the model, and even run a compressed model versus the full one; compression methodologies are saving space when it comes to specialized neural network function.
Here's a snippet from a paper posted at Apple's Machine Learning Research resource:
'Recently, several works have shown significant success in training-free and data-free compression (pruning and quantization) of LLMs achieving 50-60% sparsity and reducing the bit-width down to 3 or 4 bits per weight, with negligible perplexity degradation over the uncompressed baseline.'
That's one example of how this can work.
This Microsoft document looks at prompt compression, another component of looking at how to shrink or reduce data in systems.
The Sparsity Approach: Focus and Variation
Sometimes you can carve away part of the system design, in order to save resources.
Think about a model where all of the attention areas work the same way. But maybe some of the input area is basically white space, where the rest of it is complex and relevant. Should the model's coverage be homogenous or one-size-fits-all? You're spending the same amount of compute on high and low attention areas.
Alternately, people engineering the systems can remove the tokens that don't get a lot of attention, based on what's important and what's not.
Now in this part of the effort, you're seeing hardware advances as well. More specialized GPU and multicore processors can have an advantage when it comes to this kind of differentiation, so take a look at everything that makers are doing to usher in a whole new class of GPU gear.
Changing Context Strings
Another major problem with network size is related to the context windows that systems use.
If they are typical large language systems operating on a sequence, the length of that sequence is important. Context means more of certain kinds of functionality, but it also requires more resources.
By changing the context, you change the 'appetite' of the system. Here's a bit from the above resource on prompt compression:
'While longer prompts hold considerable potential, they also introduce a host of issues, such as the need to exceed the chat window's maximum limit, a reduced capacity for retaining contextual information, and an increase in API costs, both in monetary terms and computational resources.'
Directly after that, the authors go into solutions that might have broad application, in theory, to different kinds of fixes.
Dynamic Models and Strong Inference
Here are two more big trends right now: one is the emergence of strong inference systems, where the machine teaches itself what to do over time based on its past experience. Another is dynamic systems, where the input weights and everything else changes over time, rather than remaining the same.
Both of these have some amount of promise, as well, for helping to match the design and engineering needs that people have when they're building the systems.
There's also the diffusion model where you add noise, analyze, and remove that noise to come up with a new generative result. We talked about this last week in a post about the best ways to pursue AI.
Last, but not least, we can evaluate traditional systems such as digital twinning. Twinning is great for precise simulations, but it takes a lot of resources – if there's a better way to do something, you might be able to save a lot of compute that way.
These are just some of the solutions that we've been hearing about and they dovetail with the idea of edge computing, where you're doing more on an endpoint device at the edge of a network. Microcontrollers and small components can be a new way to crunch data without sending it through the cloud to some centralized location.
Think about all of these advances as we sit through more of what people are doing these days with AI.
Hashtags

Try Our AI Features
Explore what Daily8 AI can do for you:
Comments
No comments yet...
Related Articles


Tom's Guide
a minute ago
- Tom's Guide
Apple's smartphone market share plummets as Samsung surges — here's why
The rising demand for the best foldable phones is great for Samsung and not so great for Apple. Now, a new report shows Samsung is eating into Apple's market lead in the U.S. after the iPhone maker saw a rare double-digit dip last quarter. In the second quarter of 2025, Apple's market share in the U.S. fell from 56% to 49%, while Samsung's share surged from 23% to 31%, according to data from Canalys. That means Samsung managed to close the market share gap between it and Apple from 33% a year ago to 18% last quarter. Much of Samsung's second-quarter improvement hinged on its more affordable Galaxy A series, like the Galaxy A36. That just goes to show that the race for the best cheap phones is heating up alongside rising prices. Samsung's premium offerings, particularly the Galaxy Z Fold 7 and Z Flip 7 that debuted last month, have also gained plenty of traction on social media for their surprising durability and value. Canalys credits Samsung's record Q2 growth to its strategy of "smart volume," a focus on offering a wider range of products at different price levels compared to Apple's line-up. Samsung's Galaxy and Z phone lineups start at $650 (for the Galaxy S24 FE) and go up to $2,400 (for the 1TB storage option Galaxy Z Fold 7). "That is a massive span of devices,' Canalys analyst Runar Bjorhovde told NBC News. 'There is an idea that you can target people at every single price point, and you can meet them at every spot.' Though the overall U.S. smartphone market barely grew during the same period, up to 27.1 million units compared to last year's 26.7 million, Samsung enjoyed the strongest performance of any phonemaker. The company shipped 8.3 million units in Q2 2025, a 38% increase year over year. Apple, on the other hand, saw shipments fall by 11% to 13.3 million units, down from 14.9 million a year ago. Apple still holds the crown for most smartphone sales in the U.S., but for the first time in over a decade, its position is beginning to look shaky on its home turf. The last time we saw Samsung seriously challenge Apple's top spot was back in 2014, when the Korean smartphone manufacturer embraced big-screen phones and phablets while Apple was still dragging its feet. Consumer preference isn't the only thing fueling the market share shift, though. As Canalys notes, "Samsung's performance in Q2 was boosted by frontloading of inventory into the U.S. amid tariff concerns," one of several strategies smartphone makers used to minimize the impact of Trump's tariffs. Get instant access to breaking news, the hottest reviews, great deals and helpful tips. If Apple wants to regain its standing in the U.S. market, it'll have to pull out all the stops at its iPhone 17 event this September. This year promises a big shake-up for the iPhone lineup, with the Plus model expected to be replaced by the super-thin iPhone 17 Air. The iPhone 17 Air is a direct competitor to the Galaxy S25 Edge, Samsung's thinnest phone yet, but it may not be enough to change Apple's fortunes. Counterpoint Research, a research firm that estimates smartphone sales, found Samsung only saw a "slight year-over-year boost" in sales after the S25 Edge was released in May. If consumers are as blasé about the Air as they were about the Edge, Apple's going to be in trouble. Follow Tom's Guide on Google News to get our up-to-date news, how-tos, and reviews in your feeds. Make sure to click the Follow button.
Yahoo
30 minutes ago
- Yahoo
Tempus AI (TEM) Raises Full-Year Revenue Guidance
Tempus AI has posted an impressive price move of 21% over the last week, likely influenced by a blend of recent developments. The company announced significant revenue growth in its second quarter results and notably reduced its net loss. This positive financial performance was reinforced by the raised full-year revenue guidance, although it accompanies challenges such as a class action lawsuit and a substantial equity offering. The broader market has also been on an upward trajectory, with the Dow reaching records, which may have contributed to buoying Tempus AI's stock amidst mixed news on legal and financial fronts. We've discovered 3 warning signs for Tempus AI that you should be aware of before investing here. We've found 19 US stocks that are forecast to pay a dividend yield of over 6% next year. See the full list for free. Tempus AI's recent developments, including significant revenue growth and reduced net loss for the second quarter, have influenced its impressive 21% share price increase over the last week. Over the longer period of the past year, the company's total return was 45.01%, showcasing substantial growth compared to both the broader market and its industry peers. Tempus AI outpaced the US Life Sciences industry, which had a return of -19.8%, and also exceeded the US market's 17% return. These positive financial results reinforce the company's revenue and earnings forecasts, supported by strong testing volumes and strategic biopharma partnerships. Analysts have projected Tempus AI's revenue to grow by 29.8% annually over the next three years, even though profitability remains elusive in the short term. The raised full-year revenue guidance could further bolster future earnings, provided reimbursement and regulatory challenges are effectively managed. Despite the current share price of $73.78, slightly above the consensus analyst price target of $70.0, the company's rapid growth trajectory potentially justifies this premium. Analysts' expectations reflect a degree of agreement regarding Tempus AI's valuation, suggesting that the stock may be fairly priced. However, sustained momentum in revenue, coupled with disciplined cost management, will be crucial for aligning with long-term growth objectives and closing any gaps between market performance and valuation targets. Our comprehensive valuation report raises the possibility that Tempus AI is priced higher than what may be justified by its financials. This article by Simply Wall St is general in nature. We provide commentary based on historical data and analyst forecasts only using an unbiased methodology and our articles are not intended to be financial advice. It does not constitute a recommendation to buy or sell any stock, and does not take account of your objectives, or your financial situation. We aim to bring you long-term focused analysis driven by fundamental data. Note that our analysis may not factor in the latest price-sensitive company announcements or qualitative material. Simply Wall St has no position in any stocks mentioned. Companies discussed in this article include TEM. This article was originally published by Simply Wall St. Have feedback on this article? Concerned about the content? with us directly. Alternatively, email editorial-team@ Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data
Yahoo
30 minutes ago
- Yahoo
Peter Lynch: 'Stock Market Has Been The Best Place To Be, But If You Need Money In 1 or 2 Years, You Shouldn't Be Buying Stocks'
Renowned investor Peter Lynch has underscored the importance of long-term investment strategies, advising against the pursuit of quick returns. What Happened: Lynch offered his insights to those looking forward to retirement. He cautioned that the stock market is not a short-term playground. 'The stock market's been the best place to be over the last 10 years, 30 years, 100 years. But if you need money in 1 or 2 years, you shouldn't be buying stocks,' Lynch advised. He further explained that substantial returns that can significantly alter one's lifestyle demand more than just a couple of years of investment. Hence, those planning to retire within the next five to ten years should contemplate investing in the market presently. Lynch also revealed his approach of identifying excellent companies in struggling sectors. 'I'm always on the lookout for great companies in lousy industries. Also Read: Investment Guru Peter Lynch: 'Often Great Investments Are The Ones Where Everyone Else Will Think You Are Crazy' A great industry that's growing too fast, such as computers or medical technology, attracts too much attention and too many competitors,' he said. He stressed that the best investments are not always the big players like Apple Inc. (NASDAQ:AAPL), Microsoft Corporation (NASDAQ:MSFT), or Google LLC (NASDAQ:GOOGL). Rather, companies that are flourishing in industries facing difficulties can yield better overall returns. Lynch's advice comes at a time when many are seeking guidance on retirement planning. His emphasis on long-term investment strategies over quick returns aligns with the principle of patience in investing. His strategy of identifying thriving companies in struggling industries provides a fresh perspective, challenging the conventional wisdom of investing in big names. This could potentially lead to better returns and a more secure retirement for many. Read Next Investment Guru Peter Lynch: 'If You Can't Explain To An 11-Year-Old In 2 Minutes Or Less Why You Own The Stock, You Shouldn't Own It' Up Next: Transform your trading with Benzinga Edge's one-of-a-kind market trade ideas and tools. Click now to access unique insights that can set you ahead in today's competitive market. Get the latest stock analysis from Benzinga? APPLE (AAPL): Free Stock Analysis Report TESLA (TSLA): Free Stock Analysis Report This article Peter Lynch: 'Stock Market Has Been The Best Place To Be, But If You Need Money In 1 or 2 Years, You Shouldn't Be Buying Stocks' originally appeared on © 2025 Benzinga does not provide investment advice. All rights reserved. Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data