Arthur Open-Sources First Real-Time AI Evaluation Engine

Associated Press31-03-2025

Build. Experiment. Scale. Now With Open-Source AI Evaluation.
NEW YORK, March 31, 2025 /PRNewswire/ -- AI is evolving fast—but making it work at scale remains a challenge. Today, Arthur is launching the Arthur Engine, the first open-source, real-time AI evaluation engine designed to help teams monitor, debug, and improve Generative AI and traditional ML models. No black-box monitoring. No third-party dependencies. No data privacy risks. All for free.
Why Real-Time AI Evaluation Matters in 2025
As AI adoption grows, so do its risks. Without real-time evaluation, organizations face:
Data leaks— 8.5% of employee prompts contain sensitive data (Harmonic Security).
Model degradation— AI models drift over time without ongoing monitoring.
Debugging nightmares – Slow iteration cycles lead to poor model performance.
The Arthur Engine solves these challenges by providing instant visibility, real-time guardrails, and on-the-fly model optimization—right inside your own environment.
'AI is moving fast, and we need to ensure it moves in the right direction. Open-sourcing the Arthur Engine puts powerful AI evaluation tools into the hands of developers, researchers, and builders worldwide.'
— Ashley Nader, Lead AI PM at Arthur
What Makes Arthur Engine Different?
Unlike traditional AI monitoring tools, Arthur Engine runs locally—preserving data sovereignty and eliminating compliance risks.
Real-Time AI Evaluation – Instantly detect failures before they impact production.
Active Guardrails – Intervene in real-time to prevent hallucinations and bad outputs.
Customizable Metrics – Tailor evaluations to your specific AI use case.
Privacy-Preserving & Secure – Keep all data inside your infrastructure.
Works Across All Models – Supports GPT, Claude, Gemini, open weights models, and traditional ML.
'By open-sourcing Arthur Engine, we're making AI trust and safety accessible to all developers—allowing them to safeguard AI systems with fully customizable, high-performance monitoring tools.'
— Cherie Xu, Technical Lead, Machine Learning at Arthur
AI Evaluation, Built for the Future
The Arthur Engine is part of Arthur's broader AI performance monitoring suite, designed to help organizations:
Validate AI outputs in real time
Detect performance shifts before they become problems
Ensure regulatory compliance and explainability
This open-source release marks a new standard in AI transparency, security, and performance monitoring.
AI is reshaping the world—let's make sure it performs the way it should.
About Arthur
Arthur is the leading AI performance company, empowering organizations to monitor, measure, and improve machine learning and generative AI models at scale. Designed for trust, accuracy, and efficiency, Arthur helps organizations optimize AI performance with real-time insights, proactive model monitoring, and cutting-edge guardrails.
Backed by a research-led approach, Arthur delivers exclusive capabilities that enable teams to build, deploy and scale AI with confidence.
Founded in 2019, Arthur has raised over $60M in venture funding from Index Ventures, Acrew Capital, Greycroft, Work-Bench, and other top investors.

Hashtags

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

The Coffee Bean & Tea Leaf® Kicks Off Summer with Bold New Ice Blended® Drink Menu and Refreshing Ube Coconut Lineup

Yahoo

26 minutes ago

Yahoo

The Coffee Bean & Tea Leaf® Kicks Off Summer with Bold New Ice Blended® Drink Menu and Refreshing Ube Coconut Lineup

The creators of the original Ice Blended® drink celebrate summer with nostalgic favorites, new modern flavors, and can't-miss, weekly promotions LOS ANGELES, June 6, 2025 /PRNewswire/ -- This summer, The Coffee Bean & Tea Leaf® is taking you on a flavor journey with a menu that blends nostalgia and innovation. Now through August 19, 2025, we're celebrating our legacy as the pioneers of the Ice Blended® drink, with a fresh lineup of drinks that's bound to become your new summer obsession. It all started in 1987, when a barista at Coffee Bean & Tea Leaf crafted the very first Ice Blended® drink, changing the world of coffee forever. Fast forward nearly four decades, and we're still leading the charge with that same spirit of creativity and passion. This summer, we're bringing back the classics—like the Ultimate Cold Brew Ice Blended® drink and CaraMocha Ice Blended® drink—and introducing bold new flavors that perfectly capture the essence of sunny California days. "We're thrilled to continue our legacy as tastemakers while constantly pushing the envelope with new and exciting flavor experiences," said Dee Hadley, Head of Marketing, Americas at The Coffee Bean & Tea Leaf. "We didn't just create the Ice Blended® drink, we started a movement. And now, we're inviting our guests to enjoy the flavors that made us who we are, alongside fresh, seasonal creations that are sure to make summer unforgettable." The summer lineup is all about celebrating the best of both worlds: the time-tested favorites that made us famous, and the bold new twists that keep things exciting. So, whether you're a long-time fan or new to the scene, there's something for everyone to savor. Get ready for a few surprises: The three new Ice Blended® beverages are a delicious mix of the classic and the contemporary, ensuring that each sip is as refreshing as a cool breeze on a hot Los Angeles day. Ready to experience summer in a cup? Head to your local cafe for the three new, bold Ice Blended beverages that build on Coffee Bean and Tea Leaf's legacy as a tastemaker of flavor innovation. The beverages include: Ultimate Cold Brew Ice Blended® Drink: Fan-favorite chocolate-covered espresso beans return in the new Ultimate Cold Brew Ice Blended® drink, delivering bold flavor and unbeatable texture. Finished with chocolate drizzle and whipped cream, it's the perfect caffeinated kick for summer. Want to take it up a notch? Make any drink Ultimate by adding chocolate-covered espresso beans. CaraMocha Ice Blended® Drink: A decadent remix of the beloved Mocha Ice Blended® drink, this version adds rich caramel blended in and drizzled throughout—a delightful evolution of a time-tested favorite. Pure CaraMocha Ice Blended® Drink: A twist on our popular Caramel Ice Blended, made with chocolate powder instead of vanilla for an indulgent, chocolatey sweet treat! Ube Coconut Affogato Ice Blended® Drink: Bright, tropical, and richly creamy, this seasonal sensation blends ube and coconut, crowned with a shot of espresso and a swirl of whipped cream. For those craving a lighter yet equally flavorful refreshment, Coffee Bean & Tea Leaf's new lineup of Ube Coconut beverages also includes the Iced Ube Coconut Cream Vanilla Latte with Boba, Iced Ube Coconut Cream Matcha Latte and Ube Coconut Cream Cap Modifier, for a tropical touch to any drink. To celebrate Coffee Bean & Tea Leaf's tastemaker history, the brand is rolling out exciting promotions all summer long: National Ice Blended Day (June 11): Guests can enjoy a $3 small Ice Blended® drink from 12–6 PM. Throwback Thursdays (June 19 - August 14): Every Thursday, from 12–6 PM, guests can grab a Regular Mocha or Vanilla Ice Blended® drink for just $4 (limit 2 per transaction; paid modifiers not included). Whether you're re-experiencing the Original Ice Blended® drink or trying a tropical twist for the first time, this summer is all about flavor, fun, and frozen memories at Coffee Bean & Tea Leaf. For more information about the new beverages, please visit About The Coffee Bean & Tea Leaf® Founded in Southern California in 1963, The Coffee Bean & Tea Leaf® is a global specialty coffee and tea house that inspires new experiences through our flavors from around the world. We source the finest coffees and teas from local communities and then handcraft every beverage to bring out the freshest flavors. As the creator of The Original Ice Blended®, we continue to innovate to enable people everywhere to enjoy the classics as well as new flavors both in our cafés and at home. Headquartered in Asia and a business of the Jollibee Group of Companies, The Coffee Bean & Tea Leaf passionately operates in more than 1,100 locations, across over 20 countries. For more information, visit View original content to download multimedia: SOURCE The Coffee Bean & Tea Leaf

Google's AI is ‘hallucinating,' spreading dangerous info — including a suggestion to add glue to pizza sauce

New York Post

28 minutes ago

New York Post

Google's AI is ‘hallucinating,' spreading dangerous info — including a suggestion to add glue to pizza sauce

Google's AI Overviews, designed to give quick answers to search queries, reportedly spits out 'hallucinations' of bogus information and undercuts publishers by pulling users away from traditional links. The Big Tech giant — which landed in hot water last year after releasing a 'woke' AI tool that generated images of female Popes and black Vikings — has drawn criticism for providing false and sometimes dangerous advice in its summaries, according to The Times of London. 3 Google's latest artificial intelligence tool which is designed to give quick answers to search queries is facing criticism. Google CEO Sundar Pichai is pictured. AFP via Getty Images In one case, AI Overviews advised adding glue to pizza sauce to help cheese stick better, the outlet reported. In another, it described a fake phrase — 'You can't lick a badger twice' — as a legitimate idiom. The hallucinations, as computer scientists call them, are compounded by the AI tool diminishing the visibility of reputable sources. Instead of directing users straight to websites, it summarizes information from search results and presents its own AI-generated answer along with a few links. Laurence O'Toole, founder of the analytics firm Authoritas, studied the impact of the tool and found that click-through rates to publisher websites drop by 40%–60% when AI Overviews are shown. 'While these were generally for queries that people don't commonly do, it highlighted some specific areas that we needed to improve,' Liz Reid, Google's head of Search, told The Times in response to the glue-on-pizza incident. 3 Google AI Mode is an experimental mode utilizing artificial intelligence and large language models to process Google search queries. Gado via Getty Images The Post has sought comment from Google. AI Overviews was introduced last summer and powered by Google's Gemini language model, a system similar to OpenAI's ChatGPT. Despite public concerns, Google CEO Sundar Pichai has defended the tool in an interview with The Verge, stating that it helps users discover a broader range of information sources. 'Over the last year, it's clear to us that the breadth of area we are sending people to is increasing … we are definitely sending traffic to a wider range of sources and publishers,' he said. Google appears to downplay its own hallucination rate. When a journalist searched Google for information on how often its AI gets things wrong, the AI response claimed hallucination rates between 0.7% and 1.3%. 3 Google's AI Overviews, was introduced last summer and is powered by the Gemini language model, a system similar to ChatGPT. AP However, data from the AI monitoring platform Hugging Face indicated that the actual rate for the latest Gemini model is 1.8%. Google's AI models also seem to offer pre-programmed defenses of their own behavior. In response to whether AI 'steals' artwork, the tool said it 'doesn't steal art in the traditional sense.' When asked if people should be scared of AI, the tool walked through some common concerns before concluding that 'fear might be overblown.' Some experts worry that as generative AI systems become more complex, they're also becoming more prone to mistakes — and even their creators can't fully explain why. The concerns over hallucinations go beyond Google. OpenAI recently admitted that its newest models, known as o3 and o4-mini, hallucinate even more frequently than earlier versions. Internal testing showed o3 made up information in 33% of cases, while o4-mini did so 48% of the time, particularly when answering questions about real people.

Google's AI Mode Now Creates Interactive Stock Charts For You

CNET

39 minutes ago

CNET

Google's AI Mode Now Creates Interactive Stock Charts For You

Google's AI Mode now can create interactive charts when users ask questions about stocks and mutual funds, the company said in a blog post Thursday. Users might ask the site to compare five years of stock performances for the biggest tech companies, or request to see mutual funds with the best rates of return over the past decade. Gemini, Google's AI engine, will then create an interactive graph and comprehensive explanation. I created a sample by going to the webpage for the new experiment (if you try it at work, you might learn that your admin has banned it, but it should work on a personal computer). Once there, I told it exactly what I wanted, "make me an interactive chart showing MSFT stock over the past five years." It produced the chart, and I was able to move the slider from one date to another, showing the stock price on that date. It's the same kind of chart you can probably get at your financial advisor's site, but it did work. Tell Google AI what chart you want, and it will create one that you can interact with. CNET But be warned: AI has accuracy issues, and users need to be extra-careful with financial information of any kind. "AI has historically struggled with quantitative reasoning tasks," said Sam Taube, lead investing writer at personal finance company NerdWallet. "It looks like Google's AI mode will often provide links to its data sources when answering financial queries. It's probably worth clicking those links to make sure that the raw data does, in fact, match the AI's output. If there's no link to a data source, proceed with caution; consider manually double-checking any numbers in the AI's output. I wouldn't trust any AI model to do its own math yet." The feature is a new experiment from Google Labs. At its I/O conference last month, Google announced AI Mode's ability to create interactive graphics for complex sets of data. The feature is now only for queries about stocks and mutual funds, but it will be expanded to other topics eventually. "I'd avoid asking AI any 'should I invest in XYZ' type questions," Taube told CNET. "The top AI models may be smart, but they aren't credentialed financial advisors and probably don't know enough about your personal financial situation to give good advice. What's more, AI doesn't have a great track record at picking investments, at least so far. Many AI-powered ETFs (funds that use AI to pick stocks to invest in) are underperforming the S&P 500 this year."