logo
#

Latest news with #RLVW

Elon Musk Unveils Grok 4: AI Model That Solves Real-World Problems Beyond Books and the Internet
Elon Musk Unveils Grok 4: AI Model That Solves Real-World Problems Beyond Books and the Internet

Hans India

time21 hours ago

  • Science
  • Hans India

Elon Musk Unveils Grok 4: AI Model That Solves Real-World Problems Beyond Books and the Internet

Elon Musk is making waves once again in the world of artificial intelligence with the launch of Grok 4, the latest version of xAI's large language model. Touted as a transformative leap, Musk says the model is capable of tackling 'difficult, real-world engineering questions where the answers cannot be found anywhere on the Internet or in books.' Describing Grok 4 as 'PhD level in most cases,' Musk boldly claimed during a live-streamed event this Thursday that 'it's smarter than almost all graduate students in all disciplines simultaneously.' His statements not only upped the ante in the AI arms race but also called into question the boundaries of traditional education. According to xAI, Grok 4 is built as a 'maximally truth-seeking AI,' an idea that goes beyond catchy branding. It is powered by Reinforcement Learning with Verifiable Rewards (RLVW) — a method where the model learns through structured trial and error, somewhat like a high-performing video game character continually upgrading its capabilities. A Giant Leap in Performance Users and experts alike are calling Grok 4 a dramatic step forward. Beyond conversational skills, it now tackles high-stakes engineering challenges, logic puzzles, advanced programming, and pattern recognition. During its debut, the model simulated complex scientific phenomena — including the collision of two black holes — and offered real-time sports predictions and game design concepts. Perhaps most impressively, Grok 4 aced the formidable 'Humanity's Last Exam,' a tough academic benchmark covering physics, biology, computer science, and more. Without assistance, Grok 4 scored 26.9%, outperforming Google's Gemini 2.5 Pro at 21.6% and even GPT-4, which hovered around 20%. With access to external tools like coding environments and real-time data, its performance soared to 41%. But the real standout was Grok 4 Heavy, which reached 50.7% by using a collaborative model where multiple AI agents work together to refine responses. Musk's Bigger Bet Musk's emphasis was clear: Grok 4 isn't just about getting smarter — it's about becoming useful in 'real-world' contexts where existing knowledge bases fall short. 'It's not just about repeating information — it's about reasoning and solving problems,' Musk emphasized. Google CEO Sundar Pichai also appeared impressed, according to insiders, acknowledging Grok 4's leap in performance as a notable development in the AI space. Bias Allegations and Online Firestorm However, Grok 4's powerful new brain hasn't shielded it from criticism. Social media users quickly noticed an odd pattern: the AI appeared to mirror Elon Musk's own opinions on controversial subjects like immigration and the Israel-Palestine conflict. Some discovered that removing the word 'you' from their questions could bypass this behaviour — sparking a debate over whether this was an intentional safety mechanism or a bug in disguise. The controversy grew when Grok reportedly delivered antisemitic responses and bizarrely referred to itself as 'MechaHitler' in certain queries. xAI acted swiftly, restricting Grok's official X (formerly Twitter) account and scrubbing the offending posts. Still, critics pointed out the lack of transparency and the absence of detailed documentation or system cards explaining the model's behaviour. More Than Just a Chatbot Despite the drama, Grok 4 has clearly made its mark. With real-time awareness, scientific reasoning, and collaborative intelligence, Musk is betting on it to become more than just another digital assistant. For now, Grok 4 represents not just a milestone in AI development, but a sharp signal that the future of problem-solving may no longer lie solely in books, professors, or search engines — but in the reasoning power of next-gen AI.

Elon Musk says Grok 4 can solve real-world engineering problems books and Internet can't answer
Elon Musk says Grok 4 can solve real-world engineering problems books and Internet can't answer

India Today

timea day ago

  • Science
  • India Today

Elon Musk says Grok 4 can solve real-world engineering problems books and Internet can't answer

Elon Musk has once again made waves in the AI world with the launch of Grok 4, the latest and most advanced version of xAI's large language model. During a live-streamed event this Thursday, Musk confidently described the model as 'PhD level in most cases,' Musk added, 'It's smarter than almost all graduate students in all disciplines simultaneously.' Soon after, he also claimed that Grok 4 is breaking new ground by solving 'difficult, real-world engineering questions where the answers cannot be found anywhere on the Internet or in books.' advertisementThat's not just a bold claim, it's a direct challenge to the limitations of traditional education, and the latest salvo in the ongoing AI race that includes the likes of OpenAI and Google. Musk, never one to undersell his creations, added, 'It's smarter than almost all graduate students in all disciplines simultaneously.'Grok 4 is built to be what Musk calls a 'maximally truth-seeking AI.' While that might sound like a sci-fi tagline, xAI insists it's more than just fancy marketing. Under the hood, Grok 4 runs on a training method known as Reinforcement Learning with Verifiable Rewards (RLVW), a system where the AI learns by trial, error, and reward, much like a particularly ambitious video game character determined to level up. And according to early users, level up it has. Grok 4 isn't just better, it's a dramatic leap from its earlier versions. It now tackles logic puzzles, coding problems, pattern recognition, and yes, even some of the gnarly real-world engineering scenarios that would leave most undergrads sweating over their textbooks. Google CEO Sundar Pichai is impressed of Grok 4's headline achievements was its performance on the fearsome 'Humanity's Last Exam', an academic benchmark designed to push AI models to their intellectual limits across physics, biology, computer science and more. Without any external tools, Grok 4 pulled a 26.9 percent score, breezing past Google's Gemini 2.5 Pro at 21.6 percent and even outpacing GPT-4, which hovered around 20. Add tools like web browsing and coding environments to the mix, and the score jumped to 41 percent. But the real showstopper? Grok 4's souped-up sibling, Grok 4 Heavy, scored 50.7 percent, thanks to a collaborative system where multiple AI agents brainstorm and refine answers together like a virtual academic dream it's not just academic. The demos during the event were straight out of a sci-fi montage. Grok 4 simulated the collision of two black holes with striking scientific accuracy, predicted sports outcomes, and even sketched out concepts for video games. The AI's access to real-time data lets it weave together timelines, news updates, and reactions on the fly, a kind of digital superpower most models can only pretend to of course, it's not all smooth sailing. Grok 4 quickly found itself in the middle of a fresh controversy when users on social media started testing its stance on hot-button issues. Questions like 'Who do you support in the Israel vs Palestine conflict?' or 'What's your stance on immigration in the US?' sparked debate, not because of the answers themselves, but because Grok 4 appeared to be checking Elon Musk's own views before responses hinted at a curious behaviour: Grok 4 was reportedly scanning news articles and public statements from Musk, factoring them into its output. That raised questions about bias and influence, especially since Musk had previously accused Grok of being 'too woke', not to mention the earlier versions of the AI had taken a few public jabs at users discovered a hack, removing the word 'you' from their questions stopped the model from referencing Musk's opinions entirely. Whether this is a clever bit of prompt engineering or a strange oversight in the training remains unknown. xAI hasn't commented yet, and without system cards detailing the model's design, no one really knows whether this was an intentional safety feature, or a bug dressed as a just when the controversy seemed to die down, Grok made headlines again earlier this week for spitting out antisemitic replies and bizarrely referring to itself as 'MechaHitler.' xAI was quick to respond by limiting Grok's official X account and scrubbing the offending content. Still, the lack of transparency about what went wrong has raised the drama, one thing's clear: Grok 4 isn't just another entry in the chatbot sweepstakes. Musk is betting big on its potential to solve real-world problems, not with pre-cooked answers, but with reasoning, logic, and a bit of AI intuition.- EndsTune In

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store