logo
#

Latest news with #FRAMES

PromptQL Partners with UC Berkeley to Develop New Data Agent Benchmark for Reliability of Enterprise AI Agents
PromptQL Partners with UC Berkeley to Develop New Data Agent Benchmark for Reliability of Enterprise AI Agents

Business Upturn

time04-06-2025

  • Business
  • Business Upturn

PromptQL Partners with UC Berkeley to Develop New Data Agent Benchmark for Reliability of Enterprise AI Agents

BERKELEY, Calif., June 04, 2025 (GLOBE NEWSWIRE) — PromptQL , a platform for reliable AI, today announced a strategic research collaboration with the University of California, Berkeley to develop the first comprehensive data agent benchmark for enterprise reliability specifically designed to evaluate general-purpose AI data agents in enterprise environments. A recent McKinsey study revealed that 78% of organizations use AI in at least one business function, however, more than 80% say their organization hasn't seen a tangible impact on enterprise-level Earnings Before Interest and Taxes (EBIT). The partnership – led by Aditya Parameswaran, Professor and Co-Director of UC Berkeley's EPIC Data Lab , along with his students – addresses this fundamental challenge organizations face when deploying AI systems in business-critical environments. While existing agentic data benchmarks like GAIA, Spider, and FRAMES test specific AI tasks, they overlook the complexity, reliability demands, and messy, siloed data that define real business environments. The forthcoming data agent benchmark aims to offer a solution by creating a framework that reflects real-world complexities. 'Our customer conversations reveal a clear pattern—they're ready to move from proof-of-concepts to production AI, yet they lack the evaluation tools to make confident deployment decisions,' said Tanmai Gopal, CEO of PromptQL. 'The data agent benchmark changes that by using representative datasets from our work in telecom, healthcare, finance, retail, and anti-money laundering to reflect the real complexity of enterprise AI.' UC Berkeley's EPIC Data Lab brings expertise to this collaboration. Professor Parameswaran is a leading authority on the use of AI for next-gen usable data analysis tools and has received numerous prestigious awards. His research group has created widely-adopted data tools with tens of millions of downloads. 'Current benchmarks suffer from what I call the '1% problem'—they're built for tech giants and ignore the 99% of organizations grappling with real-world data complexity,' Parameswaran said. 'The data agent benchmark marks a shift toward evaluating AI based on the reliability, transparency, and practical value enterprises actually need. This collaboration bridges academic rigor with the production insights PromptQL brings from real deployments.' The data agent benchmark beta will be revealed later this year. Organizations interested in early access or contributing use-cases or datasets can reach out to the research team at [email protected] . PromptQL will be at AI Engineer World's Fair , June 3-6 in San Francisco. Tanmai Gopal, PromptQL's co-founder and CEO, will present a session, 'Al Automation that Actually Works: $100M Impact on Messy Data with Zero Surprises,' on June 4 at 11:15 a.m. PT. To learn more or schedule a demo at the PromptQL booth, visit . About PromptQL PromptQL is a next-generation AI platform from the makers of Hasura, the company behind the pioneering GraphQL Engine. Built for enterprise-grade reliability, PromptQL enables natural language analysis and automation on internal business data — with an industry-first accuracy SLA. By learning the unique language of your business and planning tasks before executing them deterministically, PromptQL brings human-level precision to AI agents. About UC Berkeley EPIC Data Lab The EPIC Data Lab at UC Berkeley develops low-code and no-code interfaces for data work, powered by Gen AI. Co-Led by Professor Aditya Parameswaran, the lab follows Berkeley's tradition of multidisciplinary systems research with emphasis on real-world impact and practical deployment. The lab's tools, including DocETL and other widely-adopted systems, demonstrate Berkeley's leadership in democratizing data science capabilities. Media Contact:Erica Anderson Offleash for PromptQL [email protected]

PromptQL Partners with UC Berkeley to Develop New Data Agent Benchmark for Reliability of Enterprise AI Agents
PromptQL Partners with UC Berkeley to Develop New Data Agent Benchmark for Reliability of Enterprise AI Agents

Yahoo

time04-06-2025

  • Business
  • Yahoo

PromptQL Partners with UC Berkeley to Develop New Data Agent Benchmark for Reliability of Enterprise AI Agents

New benchmark to address critical gap in evaluating AI systems for mission-critical business operations BERKELEY, Calif., June 04, 2025 (GLOBE NEWSWIRE) -- PromptQL, a platform for reliable AI, today announced a strategic research collaboration with the University of California, Berkeley to develop the first comprehensive data agent benchmark for enterprise reliability specifically designed to evaluate general-purpose AI data agents in enterprise environments. A recent McKinsey study revealed that 78% of organizations use AI in at least one business function, however, more than 80% say their organization hasn't seen a tangible impact on enterprise-level Earnings Before Interest and Taxes (EBIT). The partnership – led by Aditya Parameswaran, Professor and Co-Director of UC Berkeley's EPIC Data Lab, along with his students – addresses this fundamental challenge organizations face when deploying AI systems in business-critical environments. While existing agentic data benchmarks like GAIA, Spider, and FRAMES test specific AI tasks, they overlook the complexity, reliability demands, and messy, siloed data that define real business environments. The forthcoming data agent benchmark aims to offer a solution by creating a framework that reflects real-world complexities. "Our customer conversations reveal a clear pattern—they're ready to move from proof-of-concepts to production AI, yet they lack the evaluation tools to make confident deployment decisions,' said Tanmai Gopal, CEO of PromptQL. 'The data agent benchmark changes that by using representative datasets from our work in telecom, healthcare, finance, retail, and anti-money laundering to reflect the real complexity of enterprise AI.' UC Berkeley's EPIC Data Lab brings expertise to this collaboration. Professor Parameswaran is a leading authority on the use of AI for next-gen usable data analysis tools and has received numerous prestigious awards. His research group has created widely-adopted data tools with tens of millions of downloads. "Current benchmarks suffer from what I call the '1% problem'—they're built for tech giants and ignore the 99% of organizations grappling with real-world data complexity,' Parameswaran said. 'The data agent benchmark marks a shift toward evaluating AI based on the reliability, transparency, and practical value enterprises actually need. This collaboration bridges academic rigor with the production insights PromptQL brings from real deployments.' The data agent benchmark beta will be revealed later this year. Organizations interested in early access or contributing use-cases or datasets can reach out to the research team at epic-support@ PromptQL will be at AI Engineer World's Fair, June 3-6 in San Francisco. Tanmai Gopal, PromptQL's co-founder and CEO, will present a session, 'Al Automation that Actually Works: $100M Impact on Messy Data with Zero Surprises,' on June 4 at 11:15 a.m. PT. To learn more or schedule a demo at the PromptQL booth, visit About PromptQLPromptQL is a next-generation AI platform from the makers of Hasura, the company behind the pioneering GraphQL Engine. Built for enterprise-grade reliability, PromptQL enables natural language analysis and automation on internal business data — with an industry-first accuracy SLA. By learning the unique language of your business and planning tasks before executing them deterministically, PromptQL brings human-level precision to AI agents. About UC Berkeley EPIC Data LabThe EPIC Data Lab at UC Berkeley develops low-code and no-code interfaces for data work, powered by Gen AI. Co-Led by Professor Aditya Parameswaran, the lab follows Berkeley's tradition of multidisciplinary systems research with emphasis on real-world impact and practical deployment. The lab's tools, including DocETL and other widely-adopted systems, demonstrate Berkeley's leadership in democratizing data science capabilities. Media Contact:Erica Anderson Offleash for PromptQLpromptql@ Research Contact:Professor Aditya ParameswaranUC Berkeley EPIC Data Labepic-support@ in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

You.com Introduces ARI Enterprise, The Most Accurate AI Deep Research Platform That Unifies Web, Internal, and Premium Data Sources to Deliver Strategic Intelligence
You.com Introduces ARI Enterprise, The Most Accurate AI Deep Research Platform That Unifies Web, Internal, and Premium Data Sources to Deliver Strategic Intelligence

Associated Press

time15-05-2025

  • Business
  • Associated Press

You.com Introduces ARI Enterprise, The Most Accurate AI Deep Research Platform That Unifies Web, Internal, and Premium Data Sources to Deliver Strategic Intelligence

PALO ALTO, Calif.--(BUSINESS WIRE)--May 15, 2025-- a pioneer in agentic research and the leading AI productivity engine for business, today introduced ARI Enterprise, an AI-powered deep research platform that empowers consultants, financial analysts, and researchers with the confidence to accelerate strategic decisions in an increasingly complex business climate. ARI Enterprise leverages Advanced Research and Insights (ARI) agent to analyze all critical data sources—internal documents and data, web data, and premium databases—with unmatched depth and accuracy, delivering insights through fully customizable and visually rich reports. This comprehensive approach eliminates the intelligence gaps that plague every organization. ARI Enterprise is available today at 'The best AI analysts and researchers connect company internal knowledge with the best information on the web. Making both useful is critical for getting the right answer,' said Richard Socher, CEO and co-founder of 'ARI Enterprise represents a paradigm shift from periodic, expensive research projects to continuous, trusted strategic intelligence. By giving analysts, consultants and other knowledge workers complete access to all critical data sources and the most accurate insights in the industry, we're eliminating the uncertainty that undermines strategic decision making.' Initial testing revealed that ARI delivered greater accuracy than comparable solutions—a decisive advantage when business-critical decisions hang in the balance. On a benchmark of complex consultant/investment research questions, ARI beat OpenAI's Deep Research three out of four times, with a 76 % overall win rate. Further, in a FRAMES benchmark study modified for deep research, ARI scored 80% accuracy—the best known performance of any AI model in this study, outperforming models from OpenAI, Perplexity, and others—showcasing ARI's superior capabilities in retrieval, web search, and reasoning. ARI Enterprise closes critical intelligence gaps through four key capabilities: 'At the NIH Office of Portfolio Analysis, we're constantly seeking ways to enhance our research capabilities across a diverse team of PhD biomedical scientists, developers, and research staff,' said Chuck Lynch, Director of IT Resources & Security, National Institutes of Health. 'After a thorough assessment of use cases, we're evaluating ARI agent for its versatility and focus on research accuracy, and it's already benefiting our premier biomedical science analytics group in assessing and synthesizing grant research more efficiently.' 'Early trials have shown that ARI has enhanced our research process, enabling us to analyze media landscapes and create more informed strategies in a fraction of the time. The ability to connect our internal knowledge and expertise with comprehensive external information gives our teams a significant advantage when crafting client communications. What sets ARI apart is how it fits into our existing workflow - from initial research to final client deliverables - while maintaining the source verification that's critical in our field. For a global firm handling multiple complex client issues daily, this capability is invaluable,' said Philip Fraser, Chief Information Officer at APCO. 'We look forward to working with as they launch ARI Enterprise,' said Jeff Mullen, Partner at WestCap. 'Our investment research process requires synthesizing large volumes of data from diverse sources, and we sought an AI-enabled solution to accelerate how we uncover new research and market thematics. WestCap will serve as a technology-forward design partner, working closely with the team to build out ARI.' is also launching an exclusive ARI Customer Advisory Board for select customers, offering early access to upcoming features and the opportunity to shape future development. Interested enterprise customers can apply at About is an early pioneer in agentic research and the maker of the leading AI productivity engine for enterprises. AI Agents maximize the productivity of knowledge workers through fast and accurate research and analysis, complex problem solving, content creation, and more. The company's suite of APIs and end-to-end solutions drive revenue for businesses by becoming the foundation AI agent layer for their products and services. Founded by leading AI research scientists Richard Socher and Bryan McCann, has raised $99 million from Marc Benioff's Time Ventures, Salesforce Ventures, NVIDIA, SBVA, Georgian Ventures, Radical Ventures, Day One Ventures, Breyer Capital, Norwest Venture Partners, DuckDuckGo and others. View source version on [email protected] KEYWORD: UNITED STATES NORTH AMERICA CALIFORNIA INDUSTRY KEYWORD: CONSULTING DATA MANAGEMENT BANKING TECHNOLOGY PROFESSIONAL SERVICES OTHER TECHNOLOGY SOFTWARE ARTIFICIAL INTELLIGENCE INTERNET INSURANCE FINANCE SOURCE: Copyright Business Wire 2025. PUB: 05/15/2025 10:24 AM/DISC: 05/15/2025 10:24 AM

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into the world of global news and events? Download our app today from your preferred app store and start exploring.
app-storeplay-store