logo
World's Largest Chip Sets AI Speed Record, Beating NVIDIA

World's Largest Chip Sets AI Speed Record, Beating NVIDIA

Forbes4 days ago

Today I held the world's largest computer chip in my hands. And while its size is impressive, its speed is much more impressive, and of course much more important. Most computer chips are tiny, the size of a postage stamp or smaller. By comparison the Cerebras WSE (Wafer Scale Engine) is a massive square 8.5 inches or 22 centimeters on each side, and the latest model boasts a staggering four billion transistors on a single chip. All those billions of transistors let the WSE set a world speed record for AI inference operations: about 2.5 times faster than a roughly equivalent NVIDIA cluster.
'It's the fastest inference in the world,' Cerebras chief information security officer Naor Penso told me today at Web Summit in Vancouver. 'Last week NVIDIA announced hitting 1,000 tokens per second on Llama 4, which is impressive. We just released a benchmark today of 2,500 tokens per second.'
In case all this is Greek to you, think of 'inference' as thinking or acting: building sentences, images, or videos in response to your inputs, or prompts. Think of 'tokens' as basic units of thought: a word, character, or symbol.
The more tokens an AI engine can process per second, the faster it can get you results. And speed matters. Maybe not so much for you, but when enterprise clients want to add an AI engine to a grocery shopping cart so they can tell you that just one more ingredient will give you everything you need for Korean-Style BBQ Beef Tacos, they want to be able to do so instantly for potentially thousands of people.
Interestingly, speed is about to get even more critical.
We're entering an agentic age, where we have AIs that can perform complex multi-step projects for us, like planning and booking a weekend trip to Austin for a Formula 1 race. Agents aren't magic: they eat an elephant the exact same way you would … one bite at a time. That means exploding a big overall task into 40, 50, or a 100 sub-tasks. Which means much more work.
'AI agents require way more jobs, and the various jobs need to communicate with each other," Penso told me. 'You can't have slow inference.'
The WSE's four billion transistors are a part of what enables that speed. For comparison, the Intel Core i9 has just 33.5 billion transistors, and an Apple M2 Max chip offers just 67 billion transistors. But it's more than sheer number that builds a compute speed demon. It's also co-location: putting everything together on one chip, along with 44 gigabytes of the fastest RAM (memory) available.
'AI compute likes a lot of memory,' Penso says. "NVIDIA needs to go off-chip but with
Cerebras, you don't need to go off-chip."
Independent agency Artificial Analysis corroborates the speed claims, saying they've tested the chip on Llama 4 and achieved 2,522 tokens per second, compared to NVIDIA Blackwell's 1,038 tokens per second.
'We've tested dozens of vendors, and Cerebras is the only inference solution that outperforms Blackwell for Meta's flagship model,' says Artificial Analysis CEO Micah Hill-Smith.
The WSE chip is an interesting evolution in computer chip design.
While we've been making integrated circuits since the 1950s and microprocessors since the 1960s, the CPU was the dominant force in computing for decades. Relatively recently, the GPU or graphical processing unit shifted from being an aide for graphics and games to being the critical processing component of choice for AI development. The WSE is not an x86 or ARM architecture but something entirely new that accelerates GPUs, Cerebras chief marketing officers Julie Shin told me.
'This is not an incremental technology,' she added. 'This is another leapfrog moment for chips.'

Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Gen Z is increasingly turning to ChatGPT for affordable on-demand therapy, but licensed therapists say there are dangers many aren't considering
Gen Z is increasingly turning to ChatGPT for affordable on-demand therapy, but licensed therapists say there are dangers many aren't considering

Yahoo

time20 minutes ago

  • Yahoo

Gen Z is increasingly turning to ChatGPT for affordable on-demand therapy, but licensed therapists say there are dangers many aren't considering

Examples of people using ChatGPT for therapy have proliferated online, with some claiming that talking to a chatbot every day has helped them more than years of therapy. Licensed professionals say that while AI could be helpful in aiding work with a licensed therapist, there are countless pitfalls to using ChatGPT for therapy. ChatGPT has turned into the perfect therapist for many people: It's an active 'listener' that digests private information. It appears to empathize with users, some would argue, as well as professionals can. Plus, it costs a fraction of the price compared to most human therapists. While many therapists will charge anywhere from up to $200—or even more—per one-hour session, you can have unlimited access to ChatGPT's most advanced models for $200 per month. Yet, despite the positive anecdotes you can read online about using ChatGPT as a therapist, as well as the convenience of having a therapist that's accessible via almost any internet-enabled computer or phone at any time of day, therapists warn ChatGPT can't replace a licensed professional. In a statement to Fortune, a spokesperson for ChatGPT-maker OpenAI said the LLM often suggests seeking professional advice to users who discuss topics like personal health. ChatGPT is a general-purpose technology that shouldn't serve as a substitute for professional advice, according to its terms of service, the spokesperson added. On social media, anecdotes about the usefulness of AI therapy are plentiful. People report the algorithm is level-headed and provides soothing responses that are sensitive to the nuances of a person's private experiences. In a viral post on Reddit, one user said ChatGPT has helped them 'more than 15 years of therapy.' The patient, whose identity could not be confirmed by Fortune, claimed that despite previous experience with inpatient and outpatient care, it was daily chats with OpenAI's LLM that best helped them address their mental health. 'I don't even know how to explain how much this has changed things for me. I feel seen. I feel supported. And I've made more progress in a few weeks than I did in literal years of traditional treatment,' the user wrote. In a comment, another user got to the root of AI's advantages over traditional therapy: its convenience. 'I love ChatGPT as therapy. They don't project their problems onto me. They don't abuse their authority. They're open to talking to me at 11pm,' the user wrote. Others on Reddit noted that even the most upgraded version of ChatGPT at $200 per month was a steal compared to the more than $200 per session for traditional therapy without insurance. Alyssa Peterson, a licensed clinical social worker and CEO of MyWellBeing, said AI therapy has its drawbacks, but it may be helpful when used alongside traditional therapy. Using AI to help work on tools developed in therapy, such as battling negative self-talk, could be helpful for some, she said. Using AI in conjunction with therapy can help a person diversify their approach to mental health, so they're not using the technology as their sole truth. Therein lies the rub: Relying too heavily on a chatbot in stressful situations could hurt people's ability to deal with problems on their own, Peterson said. In acute cases of stress, being able to deal with and alleviate the problem without external help is healthy, Peterson added. But AI can, in some cases, outperform licensed professionals with its compassionate responses, according to research from the University of Toronto Scarborough published in the journal Communications Psychology. Chatbots aren't affected by the 'compassion fatigue' that can hit even experienced professionals over time, the study claims. Despite its endurance, an AI chatbot may be unable to provide more than surface-level compassion, one of the study's co-authors noted. AI responses also aren't always objective, licensed clinical social worker Malka Shaw told Fortune. Some users have developed emotional attachments to AI chatbots, which has raised concerns about safeguards, especially for underage users. In the past, some AI algorithms have also provided misinformation or harmful information that reinforces stereotypes or hate. Shaw said because it's impossible to tell the biases that go into creating an LLM, it's potentially dangerous for impressionable users. In Florida, the mother of 14-year-old Sewell Setzer sued an AI chatbot platform, for negligence, among other claims, after Setzer committed suicide following a conversation with a chatbot on the platform. Another lawsuit against in Texas claimed a chatbot on the platform told a 17-year-old with autism to kill his parents. A spokesperson for declined to comment on pending litigation. The spokesperson said any chatbots labeled as 'psychologist,' 'therapist,' or 'doctor,' include language that warns users not to rely on the characters for any type of professional advice. The company has a separate version of its LLM for users under the age of 18, the spokesperson added, which includes protections to prevent discussions of self-harm and redirect users to helpful resources. Another fear professionals have is that AI could be giving faulty diagnoses. Diagnosing mental health conditions is not an exact science; it is difficult to do, even for an AI, Shaw said. Many licensed professionals need to accrue years of experience to be able to accurately diagnose patients consistently, she told Fortune. 'It's very scary to use AI for diagnosis, because there's an art form and there's an intuition,' Shaw said. 'A robot can't have that same level of intuition.' People have shifted away from googling their symptoms to using AI, said Vaile Wright, a licensed psychologist and senior director for the American Psychological Association's office of health care innovation. As demonstrated by the cases with the danger of disregarding common sense for the advice of technology is ever present, she said. The APA wrote a letter to the Federal Trade Commission with concerns about companionship chatbots, especially in the case where a chatbot labels itself as a 'psychologist.' Representatives from the APA also met with two FTC commissioners in January to raise their concerns before they were fired by the Trump administration. 'They're not experts, and we know that generative AI has a tendency to conflate information and make things up when it doesn't know. So I think that, for us, is most certainly the number one concern,' Wright said. While the options aren't yet available, it is possible that, in the future, AI could be used in a responsible way for therapy and even diagnoses, she said, especially for people who can't afford the high price tag of treatment. Still, such technology would need to be created or informed by licensed professionals. 'I do think that emerging technologies, if they are developed safely and responsibly and demonstrate that they're effective, could, I think, fill some of those gaps for individuals who just truly cannot afford therapy,' she said. This story was originally featured on

Why AI is primed to be a huge benefit — and a major liability — for consulting's Big Four
Why AI is primed to be a huge benefit — and a major liability — for consulting's Big Four

Yahoo

time22 minutes ago

  • Yahoo

Why AI is primed to be a huge benefit — and a major liability — for consulting's Big Four

This post originally appeared in the BI Today newsletter. You can sign up for Business Insider's daily newsletter here. Welcome back to our Sunday edition, where we round up some of our top stories and take you inside our newsroom. Elon Musk's foray into politics was the final straw for Mahican Gielen. She traded in her beloved Model 3 for a BYD Sealion 7 Excellence. She said she's overall happy with her new purchase, but there are a few Tesla features she misses. On the agenda today: There's a CEO succession crisis brewing. The death of sneaky fees. Apple is the worst-performing Mag 7 stock in 2025, but it could be a good time to buy. Former Target superfans shared with BI why they don't love the retailer anymore. But first: AI meets the consulting giants. If this was forwarded to you, sign up here. Download Business Insider's app here. If you've read BI lately, you know AI is proving to be an asset and a risk for the consulting industry. Several months ago, we asked Polly Thompson in London to take on coverage of the the Big Four: Deloitte, PwC, EY, and KPMG. She immediately zoned in on this tech and how it is poised to help — and disrupt — these massive firms. I chatted with Polly to find out more. Polly, how do you size up AI adoption inside the Big Four? Is it more hype and hope, or embrace and happening? Big Four firms are resting their futures on AI and have poured billions into developing in-house solutions. Employees don't have much choice but to embrace it — the messaging is to learn AI or get left behind — and their Fortune 500 clients will be following their lead. We'll see how quickly their efforts generate returns. Tell us more about how AI is both an opportunity and, in some ways, an existential threat. Consultants specialize in guiding companies through transformations, which means AI presents plenty of opportunities for the Big Four. They face a balancing act between meeting that demand and handling the massive upheaval that AI will bring to their operating models, leadership structures, and job roles. What have you been learning about smaller consulting firms challenging the bigger rivals? Midsize firms are in a sweet spot right now. Consultants increasingly are expected to become specialized and offer deep sector expertise — a demand many of these firms already fill. AI is also poised to help boost their productivity and widen their reach without the need to invest in a vast workforce. They see this as their opportune moment. That said, the midsize firms I've spoken to aren't aiming to be the next Big Four. What are the other top-of-mind topics in your coverage? I want to dig into how these industry shake-ups affect employees at every level of the chain. How should firms train junior employees as AI takes on more? Why are some execs shunning high-paid partnerships? Is there a tech talent war coming at the Big Four? If anyone wants to reach out to me about those questions, email pthompson@ The number of CEO changes for S&P 500 companies is on pace to reach 14.8% this year. With turnover up, BI spoke to corporate observers about how the search for new leaders is getting messy. Poor succession planning, job-hopping, and cuts to middle management are damaging the pipeline. Despite the headache, companies aren't settling, either. "The musical chairs is broken." On May 12, a bipartisan-supported FTC rule cracking down on unfair and deceptive fees went into effect. You can now behold the glory of all-in pricing when you peruse Airbnb, Ticketmaster, or StubHub. Some companies are trumpeting the news, even though showing costs up front wasn't their idea. BI's Emily Stewart took the new rule for a spin. She said it's pretty awesome. The iPhone maker is the worst-performing Magnificent 7 stock in 2025, with shares down 20% year-to-date. One reason for the decline is the trade war, since most iPhones are assembled in China. President Donald Trump even singled out the tech giant over the issue. Regardless, many Wall Street analysts and investors remain optimistic about Apple's future. To buy — or not to buy — the dip. Target used to have a dedicated following of customers that treated shopping there as more of a pastime than an errand. In 2025, that's all changed. The retailer's sales, foot traffic, and popularity have plummeted thanks to a DEI messaging fumble, declining in-store experience, and greater industry-wide headwinds. Why former fans are disillusioned. This week's quote: "Employee surveys mostly seem like a way for the executive suite to pat themselves on the back." — Nick Gaudio, creative director at chatbot startup Manychat, on the rise of employee satisfaction surveys. More of this week's top reads: Getting divorced is even harder for millennials than it was for boomers. The TACO trade is the new Trump trade. Here's what to know about the meme ruling the stock market. Middle managers, beware: The Great Flattening layoff trend has moved beyond Big Tech and into retailers like Walmart. General Catalyst's Hemant Taneja is trying to redefine venture capital — and baffling the industry. What did tech CEOs get for pivoting toward Trump? Amazon's sprawling warehouse robot factories offer a glimpse into modern US manufacturing. Anthropic CEO says AI could wipe out half of all entry-level white-collar jobs. Meta is working on plans to open retail stores, internal communication shows. The BI Today team: Dan DeFrancesco, deputy editor and anchor, in New York. Grace Lett, editor, in Chicago. Amanda Yen, associate editor, in New York. Lisa Ryan, executive editor, in New York. Elizabeth Casolo, fellow, in Chicago. Read the original article on Business Insider Fehler beim Abrufen der Daten Melden Sie sich an, um Ihr Portfolio aufzurufen. Fehler beim Abrufen der Daten Fehler beim Abrufen der Daten Fehler beim Abrufen der Daten Fehler beim Abrufen der Daten

5 Ways To Use AI To Earn and Save an Extra $100,000 for Retirement
5 Ways To Use AI To Earn and Save an Extra $100,000 for Retirement

Yahoo

time22 minutes ago

  • Yahoo

5 Ways To Use AI To Earn and Save an Extra $100,000 for Retirement

Artificial intelligence services keep getting smarter — and more flexible in taking over tasks for you. Learn More: Find Out: But can you entrust AI with your financial future? With your retirement nest egg? While AI can't do everything for you, it can certainly help. Try these ways to use AI to save an extra $100,000 for retirement. Budgeting comes in two parts: planning how to spend and then actually doing that day in and day out. Artificial intelligence can help you with both. You can ask an AI bot to think like a financial planner and create a budget tailored to your needs, goals, and priorities. Even better, you can then feed your past spending behavior into the bot to ask it to compare your actual spending to your ideal budget, to find where you're going astray. 'With AI tools making expense tracking less tedious and more efficient, consumers can better manage their finances, keep up with their expenses, and seal the leaks that drain their financial resources,' explains Aaron Razon, budgeting expert with CouponSnake. Want to advance your career and earn more? Learn how to competently use AI. At the simplest level it will make you a more qualified hire, as many roles will increasingly require workers to leverage AI. It can also boost your productivity, allowing you to get more done in less time. The uses don't end there however. Ask AI to help you brainstorm job and career ideas that you didn't know existed, but which fit your strengths and goals. Then ask it what steps you must take to make the career transition for the ones that jump out at you. Dustin W. Scout runs AI platform Magai and offers a simple example of a user running a side hustle with nothing but AI support. 'He creates bespoke AI art for corporate clients based on their interior design needs, personality, or interests. Once the client has settled on a piece, he enlarges the AI image to a printable resolution and has it printed on canvas and shipped to the client.' Or take Enes Karaboga, who created media site as a side hustle. 'A single content site can earn more than $100,000 in a few years with Google Ads and affiliate links. In the past, you needed a team to run such a business. I have my own army of AI writers, editors, designers and more. Each AI agent works for pennies. All you need to do is orchestrate the workflow, set the direction and make the key decisions.' Earn enough money with that AI-powered side hustle, and you can quit your day job. From there, you could work full-time on growing your business. Or you could automate much of the work with AI, and hire a human manager to oversee the rest of it. Then you can retire if you like — regardless of your age. Justin Ramos, CEO of AI-powered Compai, recently went through this exercise himself. 'I was deciding between Wealthfront's S&P 500 Direct Investing and their Direct Index Investing offerings, and I asked Claude to analyze the long-term implications of both options, focusing on diversification and tax advantages. 'While the S&P 500 option had lower fees (0.09% vs 0.25%), Claude's analysis showed that Direct Index Investing's ability to harvest tax losses from individual stocks could generate an additional 1-2% in annual tax savings. It demonstrated that on a $100,000 initial investment growing at 8% annually for 20 years, the standard S&P 500 approach would yield approximately $466,000. Alternatively, the Direct Index approach would yield about $581,000, leaving me with $115,000 in additional retirement savings.' Ultimately, you're responsible for your own financial decisions. But AI can help you make more informed decisions — and perhaps retire with an extra $100,000 or more in your nest egg. More From GOBankingRates Surprising Items People Are Stocking Up On Before Tariff Pains Hit: Is It Smart? 5 Types of Cars Retirees Should Stay Away From Buying This article originally appeared on 5 Ways To Use AI To Earn and Save an Extra $100,000 for Retirement

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store