logo
#

Latest news with #Pentaho

Why Data Fitness Is The Foundation For AI Success
Why Data Fitness Is The Foundation For AI Success

Forbes

time28-07-2025

  • Business
  • Forbes

Why Data Fitness Is The Foundation For AI Success

Maggie Laird is the President of Pentaho, Hitachi Vantara's data software business unit. We're entering a new era of enterprise automation, one where intelligent agents can analyze, decide and act independently. But while the industry's imagination races forward, infrastructure still lags behind. Ninety-three percent of enterprise IT leaders have implemented or plan to implement AI agents in the next two years, according to a recent survey by MuleSoft and Deloitte Digital cited by ZDNET. However, the survey also found that 95% of the IT leaders said they are struggling to integrate data across systems. Meanwhile, Gartner recently noted that, through 2025, 'poor data quality will persist as one of the most frequently mentioned challenges prohibiting advanced analytics (e.g., AI) deployment.' AI agents can only be as good as the data that feeds them. And most enterprise data environments are far from ready. They're fragmented. Opaque. Siloed by system, department, geography or format. Putting AI to work in those environments won't be transformative. That's automation built on sand. Enterprises are at a crossroads. Many are in the initial stages of deploying next-gen agents to boost productivity, streamline decision making and lower costs. But without a trusted, accessible and well-governed data foundation, those ambitions rest on shaky ground. AI agents don't just need data. They need data that is structured, contextualized, traceable and aligned to the organization's goals. Organizations must overcome a false sense of readiness. Think about what AI agents do: They generate signals, draw inferences and act. If they're trained on or use inconsistent or incomplete inputs, the decisions they make will reflect those flaws. While agents may move faster than humans, if they're working off half-truths or hidden assumptions, the consequences can quickly multiply. Thankfully, this isn't a technology gap. It's a data-readiness gap. And it needs executive-level attention. Over the past 20 years, my company has worked with some of the most data-intensive organizations in the world, from global banks and national defense systems to airlines, telecoms and critical infrastructure providers. Across these environments, four themes consistently separate those who scale well from those who stall and will be essential to an agentic world: 1. Build a unified, queryable data catalog. Most enterprises don't have a clear inventory of what data they have, where it lives or how it relates. A living, searchable catalog makes it possible for both humans and machines to understand and access what is available and use it responsibly. Organizations need to catalog both structured data and the rapidly growing pool of unstructured data, which makes up 80% to 90% of data today. Unstructured data is everything from videos to sales presentations to emails to social media posts that provide context for decisions. While this data will still likely live in different silos, a catalog is foundational to creating the data products that AI agents need to get a full picture of the business challenge they are addressing. 2. Operationalize data governance. Governance isn't about limiting access; it's about enabling trusted use. Yes, AI agents must know what data is relevant. But agents also must know how it was processed and what rules apply to its use. Think lineage, version control and explainability as baseline requirements. Every agent needs to know where the data came from, when it was changed, if it was changed, by whom and for what reason. For example, data may be scrubbed so voraciously that a person's middle initial gets removed in one dataset but not others. That can confuse an AI agent when, for example, gathering intelligence on whether a certain loan product would be good for a particular person. 3. Apply intelligent access controls. The data democratization that will drive agents doesn't mean letting every AI workload touch every dataset. It means giving the right agents access to the right data, under the right guardrails, for the right purposes. This requires policies that adapt to roles, risk levels and regulations. 4. Design pipelines for business relevance and scale. A smart pipeline isn't just fast; it's aligned. Different agents will need different datasets, formats, and levels of latency depending on the task. For instance, AI agents working on procurement will want and need access to different datasets than AI agents working on marketing. Once agents are working, monitor their success—or lack thereof—and make necessary changes in the data, the data pipelines and the access and control policies. Build data flows that can evolve with business needs and AI capabilities. Readiness is an ongoing practice, not a one-time project. Being 'ready for AI' isn't a one-time certification. It's a commitment to continuously refining how your organization collects, manages and mobilizes information. The companies that win with AI won't be the ones who adopt the most tools. They'll be the ones who do the most with what they already know and ensure their intelligence systems are built not just to automate, but to align. Make space for the agents of change, but first, build the conditions they need to succeed. AI is moving. Fast. The quality of your data will determine whether you're leading or watching from behind. Forbes Technology Council is an invitation-only community for world-class CIOs, CTOs and technology executives. Do I qualify?

Pentaho Releases Significant Updates to Pentaho Data Catalog
Pentaho Releases Significant Updates to Pentaho Data Catalog

Yahoo

time10-06-2025

  • Business
  • Yahoo

Pentaho Releases Significant Updates to Pentaho Data Catalog

Pentaho's Enhanced Governance, Quality and Observability Capabilities Meet Growing Customer Demand for Automation and Visibility Around Core Operations and AI Efforts SANTA CLARA, Calif., June 10, 2025 /PRNewswire/ -- Pentaho, an industry leading data intelligence and integration platform utilized by 73% of the Fortune 100, today announced significant enhancements to Pentaho Data Catalog, designed to help organizations achieve the data fitness needed for the AI age by increasing quality, observability and trust in data. The new enhancements to Pentaho Data Catalog expand the capacity of customers like FirstBank and Lightbox to better understand and organize their data, observe how that data is being used, and use automation to improve its quality, governance, and trust. Building upon Pentaho's 20-year legacy as a category leader in data management, Pentaho's continued innovation is helping customers achieve foundational, AI-ready data intelligence while avoiding the heavy infrastructure burdens and slow time to value of competing solutions. "The need for strong data foundations has never been higher, and customers are looking for help across a whole range of issues. They want to improve the organization of data for operations and AI. They need better visibility into the "what and where" of data's lifecycle for quality, trust, and regulations. And they want to leverage automation to scale management with data while also increasing time to value," said Kunju Kashalikar, Product Management, Pentaho. "The latest enhancements to our catalog strike at the heart of these issues. We're excited to bring this continued innovation to our customers and look forward to helping them on their journey to data fitness." Pentaho Data Catalog – Policy Improvements, AI Model Management and Data Products The latest updates to Pentaho Data Catalog extend key capabilities that are becoming more critical in an increasingly complex data landscape being overtaken by AI and regulations. An enhanced Data Marketplace experience enables executives, business users and data scientists to more easily find curated and trusted data sets for daily and strategic efforts. Deeper integrations with Okta and Active Directory improve policy access and security measures, important when guard railing data's use in AI models. Creation of data products with prescribed quality and sensitivity characteristics. Data delivery to data points of use including Python IDE, ML Test and Deployment tools. Integration with model development for model governance increases visibility into how and where models are accessing data for both appropriate use and proactive governance. ML enhancements for data classification, including unstructured data, improve the ability to automate and scale how data is managed for expanding data ecosystems. Enhancements to data optimization and re-tiering for structured and unstructured data support the use cases of archiving, migration, and policy driven lifecycle management. While customers such as the State of Arizona are benefiting from the solution's robust automation, policy, governance and classification capabilities, Pentaho Data Catalog has also been receiving significant industry attention. Pentaho was named a Major Player in the IDC MarketScape: Worldwide Data Intelligence Platform Software 2024 Vendor Assessment (doc # US51467224, November 2024), along with being recognized recently by BigDATAwire (Pentaho Data Catalog - 2024 Reader's Choice, Best Big Data Product: Data Catalog /Security /Governance) and the Data Breakthrough Awards (Pentaho Data Catalog - Data Catalog Solution of the Year). About PentahoTrusted by more than 73% of the Fortune 100, Pentaho is an independent business unit of Hitachi, Ltd (TSE:6501) that helps businesses become data-fit and AI-ready so they can innovate quickly and operate with confidence. Pentaho simplifies data chaos in a rapidly changing data and regulatory landscape through its leading data intelligence and integration platform, which streamlines data discovery, availability, governance, and insights. Learn more at About Hitachi VantaraHitachi Vantara is transforming the way data fuels innovation. A wholly owned subsidiary of Hitachi Ltd., Hitachi Vantara provides the data foundation the world's leading innovators rely on. Through data storage, infrastructure systems, cloud management and digital expertise, the company helps customers build the foundation for sustainable business growth. To learn more, visit About Hitachi, drives Social Innovation Business, creating a sustainable society through the use of data and technology. We solve customers' and society's challenges with Lumada solutions leveraging IT, OT (Operational Technology) and products. Hitachi operates under the 3 business sectors of "Digital Systems & Services" – supporting our customers' digital transformation; "Green Energy & Mobility" – contributing to a decarbonized society through energy and railway systems, and "Connective Industries" – connecting products through digital technology to provide solutions in various industries. Driven by Digital, Green, and Innovation, we aim for growth through co-creation with our customers. The company's revenues as 3 sectors for fiscal year 2023 (ended March 31, 2024) totaled 8,564.3 billion yen, with 573 consolidated subsidiaries and approximately 270,000 employees worldwide. For more information on Hitachi, please visit the company's website at View original content to download multimedia: SOURCE Hitachi Vantara Sign in to access your portfolio

Getting AI-Ready In A Hybrid Data World
Getting AI-Ready In A Hybrid Data World

Forbes

time05-06-2025

  • Business
  • Forbes

Getting AI-Ready In A Hybrid Data World

Maggie Laird is the President of Pentaho, Hitachi Vantara's data software business unit. If AI is the future, then data is the terrain. And most businesses are hiking in flip-flops. We all feel the urgency. Executives know that AI has the potential to drive exponential gains in productivity, insight and market leadership. But inside most organizations, the reality looks a lot less like ChatGPT magic and more like stalled pilots, costly proof-of-concepts and growing technical debt. Gartner predicts that through 2025, 60% of AI projects will be dropped because they are 'unsupported by AI-ready data.' What's more, 63% of organizations either lack proper data management practices for AI or aren't sure whether they have them, according to Gartner. And it's not just about volume or speed. It's about trust, structure and the ability to bridge hybrid environments without losing fidelity or context. Adopting AI today is a lot like installing a high-performance kitchen in a house with faulty wiring and sagging floors: flashy, expensive and ultimately unsafe. That's because AI doesn't magically fix data problems—it amplifies them. Many of the core data challenges that AI surfaces are issues companies have faced for years. Most businesses have data scattered across multiple environments: in the cloud, on-premises or a mix of both. Take customer retention. A company may have six streams of data tied to improving it. Some of the data is structured in columns and rows. But other critical data is unstructured—buried in email, PDFs or training videos. Most companies have mountains of unstructured data that hasn't been labeled, making it inaccessible to AI or incompatible with structured data. If a company wants AI to detect customer retention issues or trends, it needs all the relevant data to build an accurate picture. Without it, insights are skewed and incomplete—and the resulting decisions may be wrong. Imagine setting an AI agent loose to fix customer retention problems without reliable data. Who knows what could happen? Air Canada, for example, was recently found liable for a chatbot that gave a passenger incorrect information. The airline argued that it shouldn't be held responsible for what the chatbot said, but the court disagreed. The bar has been raised. 'Good enough' is no longer acceptable. AI is often designed to operate without humans in the loop, which means errors can go undetected. To get the right and best results from AI, organizations need a strong data foundation—what I call 'data fitness.' Here are four key indicators that your organization is data-ready for AI: Most organizations don't. Data lives in the cloud, on-prem servers, Slack threads, old Excel files and more. Being data-fit means you've cataloged what matters, labeled what's useful and can locate the most current version of any given asset—structured or unstructured. Your platform should connect across hybrid environments without needing to copy or move the data. To streamline this, start by clearly defining the AI use case. That makes it easier to identify what data to inventory. AI shifts decision-making to more people who don't have 'data' in their job titles. An analyst might know to exclude 'test region X' or adjust for seasonal bias in a report. Your AI agent won't. Neither will the product manager using a low-code interface to generate pricing suggestions. If your data isn't clean, governed and context-aware, you risk making high-speed, AI-driven decisions based on flawed inputs. That's not just bad insights—it's a serious risk. Different problems require different speeds. Historical data might be enough to plan next quarter's staffing, but real-time data could be essential for adjusting a flash sale or spotting inventory shortfalls. AI-ready platforms must operate across batch, real time and streaming—sometimes all within the same use case. Getting data-fit isn't just about cleaning up for the sake of it. It's about knowing which AI use cases matter, which data is needed and how much effort it will take to make that data usable. Sometimes, the return isn't worth it. That's okay—clarity saves time. But in many cases, once the investment is made, follow-on projects accelerate. Readiness compounds. Future efforts don't start from zero. AI isn't a tech project any longer, it's a business imperative. But without a solid data foundation, the tools don't matter. A home kitchen remodel inevitably involves hard, messy work—so does good data management. AI just makes it more urgent than ever. Forbes Technology Council is an invitation-only community for world-class CIOs, CTOs and technology executives. Do I qualify?

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store