logo
What Companies Should Know About Implementing A Data Lakehouse

What Companies Should Know About Implementing A Data Lakehouse

Forbesa day ago
Abhik Sengupta, Principal Solution Architect, Hakkoda.
Traditional data warehouses—once the backbone of business intelligence and reporting—are increasingly misaligned with today's data demands. The surge in data volume, velocity and variety has exposed their architectural constraints: rigid schemas, high storage costs, poor handling of semi-structured data, and reliance on batch-oriented extract, transform and load (ETL) processes.
In a Teradata/Vanson Bourne survey from 2018, 74% of decision-makers were already citing analytics complexity as a challenge with data warehouses, and 79% said users lacked access to all the data they needed. By 2021, DBTA reported that 88% of organizations struggled with data loading in these environments, and 42% still relied on manual cleanup and transformation.
These limitations are particularly problematic in cloud-native environments where real-time analytics, AI workloads and globally distributed teams demand flexibility and speed.
To overcome these challenges, many enterprises are adopting lakehouse architectures, which are intended to unify the governance and performance of data warehouses with the scalability and openness of data lakes.
As a principal solution architect, I've led several large-scale lakehouse implementations across platforms like Snowflake, Coalesce and Sigma. In this article, I'll explain how lakehouses can address legacy bottlenecks and what organizations should consider when modernizing their data platforms with this approach.
Why Companies Are Shifting To Lakehouse Architectures
At its core, a lakehouse stores structured, semi-structured and unstructured data in low-cost object storage while layering on transactional features, schema enforcement and version control through a metadata management layer. This enables organizations to build both batch and streaming data pipelines, maintain high data quality and support time travel and auditability within the same platform.
A major advantage of lakehouses is their interoperability. Multiple analytics and machine learning engines can access the same datasets simultaneously, eliminating the need for redundant copies or specialized infrastructure. This can improve collaboration across teams, speed up experimentation and simplify data governance.
By unifying ingestion, processing, analytics and AI workloads, lakehouses can reduce operational complexity while increasing agility. They can also provide a composable foundation for building domain-driven data products, enabling real-time personalization.
In fact, a study published earlier this year in Information Systems found that the lakehouse is "inexpensive, quick and adaptable" like a data lake, while combining the "structure and simplicity of a [data warehouse] with the broader use cases of a [data lake]."
From personal experience, in one project I worked on, onboarding time dropped by 40% due to reusable pipeline templates and declarative schema handling.
Importantly, built-in features like versioning and time travel enable data auditability, governance and lineage tracking using tools such as Great Expectations and CloudWatch. That said, it's important to consider which engines—such as Spark, Snowflake and Athena—are supported to enable flexible, future-ready analytics environments.
This will be particularly important as companies work to adopt AI. Unlike traditional data warehouses, lakehouses support diverse, large-scale datasets—including unstructured formats—within one repository. Versioning and snapshotting enable repeatable, auditable ML workflows. Support for Spark and Flink can allow scalable model training directly on fresh data, essential for real-time personalization and AI governance.
Technical Architecture: Building A Real-World Lakehouse Stack
Implementing a lakehouse architecture is a multiphase transformation that spans the full data life cycle, from ingestion to governance. It's not a one-size-fits-all deployment, but a set of strategic choices that must align with organizational priorities, technical maturity and interoperability needs:
1. Ingestion: This is the foundation, where teams must assess the nature of their data sources, expected latency and format diversity. Successful implementations typically use schema-aware tools that preserve metadata and support both batch and streaming pipelines to ensure consistency downstream.
2. Processing And Transformation: In this stage, raw data is converted into analytics- and ML-ready formats. Most lakehouse platforms support schema evolution, versioning and time-travel-like capabilities, allowing teams to build reproducible pipelines and accommodate changing data structures without data loss.
3. Implementing The Storage Layer: This typically uses cloud-native object stores (like S3, ADLS or GCS), with an open format and a metadata layer to manage immutability, partitioning and optimization. The goal is scalable, low-cost storage that enables fast access and governance at scale.
4. Query And Analytics: Lakehouses often support multi-engine interoperability, allowing business intelligence tools, SQL engines and ML frameworks to access the same governed datasets. Companies must catalog integration data and metadata consistently to ensure reliable performance and trusted insights.
5. Orchestration: Layers must accommodate schema evolution, rollback and modular pipelines. Most teams implement CI/CD for data workflows, using orchestration tools like Airflow, Dbt or Step Functions to ensure reproducibility and resilience.
6. Governance And Observability: Both of these functions should span the entire stack. Versioned metadata, data contracts, lineage tracking and quality testing tools (e.g., Great Expectations, Soda or Monte Carlo) play a central role in building trust and compliance across domains.
What It Takes To Prepare For The Lakehouse
Success with a lakehouse depends on more than just tooling—it requires team readiness, clear processes and thoughtful design. Organizations must build capabilities in schema evolution, cross-engine interoperability and performance tuning to meet latency and cost goals.
For compliance (e.g., GDPR, HIPAA, SOX), the architecture must support data lineage, time-based audits and immutability. This includes implementing version-controlled metadata, retention policies, role- and policy-based access controls, encryption (at rest and in transit) and detailed logging. Observability and data contracts are essential to detect quality issues before they become compliance risks.
Operationally, automation is key. Tasks like compaction, metadata cleanup and performance optimization must be built into workflows. While platform integration is improving, gaps remain in business intelligence and orchestration tools, making testing and validation critical.
Finally, readiness also depends on people. Invest in upskilling through structured training, reusable frameworks and real-world pilots. These accelerate adoption and reduce errors.
By addressing these concerns, companies can build a scalable lakehouse foundation—ready to support governed, high-performing data products and AI at enterprise scale.
Forbes Technology Council is an invitation-only community for world-class CIOs, CTOs and technology executives. Do I qualify?
Orange background

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Related Articles

Palantir Looks Like a Great Stock. Or Does It?
Palantir Looks Like a Great Stock. Or Does It?

Yahoo

time6 hours ago

  • Yahoo

Palantir Looks Like a Great Stock. Or Does It?

Key Points Palantir delivered record performance thanks to the rise of its commercial business. The stock is benefiting from a strong momentum. Despite the excitement, the stock trades at a rich valuation, leaving little room for error if growth slows. 10 stocks we like better than Palantir Technologies › Palantir Technologies (NASDAQ: PLTR) has been on a tear lately. The data analytics company has captured investor attention with record results, surging demand for its AI-driven platforms, and a stock price that's been on an impressive run. For many, the bullish case seems obvious: strong growth, expanding margins, and a technology stack that appears well positioned for the AI era. But as any seasoned investor knows, there's a difference between a great business and an excellent stock. Palantir may have the first part nailed -- but the second is where the story gets complicated. Let's unpack both sides of the argument. Palantir is firing on all cylinders Palantir's latest results blew past expectations. In the second quarter of 2025, revenue rose 48% year over year to $1 billion -- the highest in its history -- while adjusted operating income rose 24% to $464 million. What's driving this surge? One is that Palantir's commercial business is expanding rapidly as enterprises adopt its Artificial Intelligence Platform (AIP) for their AI transformation. For perspective, the AI company closed a record $803 million in U.S. commercial total contract value (TCV), representing a 134% year-over-year improvement in the latest quarter. But that's just part of the story. Palantir's traditional public sector business also delivered improvements, as U.S. government revenue increased 45% year over year in the same quarter. These twin engines of growth position the tech company well to sustain its momentum in the coming quarters, if not years, riding on the massive AI transformation wave. Momentum could keep the rally going Beyond fundamentals, Palantir is benefiting from a powerful momentum wave. The stock has gained more than 500% over the past year, and such rapid appreciation often attracts momentum traders and algorithmic funds. These investors aren't necessarily buying because they've done a deep dive into Palantir's financials -- they're buying because the trend is up. Momentum can be a powerful short-term force. In Palantir's case, several factors are aligning to keep investor enthusiasm high: A strong narrative around AI leadership. Expanding adoption across both commercial and government sectors. Upwardly revised guidance often keeps analysts and traders optimistic. This dynamic doesn't guarantee further gains, but it does mean that sentiment may continue to act as a tailwind in the coming month, especially if Palantir continues to deliver quarterly upside surprises. However, momentum is notoriously fickle. When it turns, the same traders who piled in can exit just as quickly, amplifying the downside. That's why long-term investors should be careful not to confuse a price surge with a permanent shift in value. The other side of the coin: valuation risk While the business story is compelling, the investment case becomes more complex when we factor in valuation. Palantir currently trades at over 132 times sales -- a premium multiple that incorporates a significant amount of future success, among other factors. History shows that even great companies can deliver mediocre returns if bought at overly optimistic valuations. Microsoft, for example, was a phenomenal business in the late 1990s -- yet investors who bought at the height of the dot-com boom had to wait more than a decade to break even, despite Microsoft's continued operational success. That's the risk here: Palantir could continue to win new contracts, expand margins, and deepen its AI moat -- yet still deliver underwhelming stock returns if today's valuation already incorporates too much optimism. Besides, paying a high premium for a stock leaves little margin for error. Just one quarter of weaker-than-expected performance can lead to a stock being rerated to a more reasonable valuation. Great for investors looking to buy, but a disaster for those who have purchased the stock at high prices. What does it mean for investors? Palantir's latest quarter showed record revenue and growing profits -- the kind of financial performance any growth investor would love to see. Furthermore, the company's AI platform is gaining traction, lending it a credible claim to being one of the most significant players in applied artificial intelligence. In the near term, momentum and investor enthusiasm could push the stock higher. But over the longer term, the most significant risk may not be the business itself -- it's the price investors are willing to pay for it today. While traders may be happy to ride the momentum, long-term investors, however, should think twice before making a bet. It will be prudent to wait for a more reasonable entry point. Should you buy stock in Palantir Technologies right now? Before you buy stock in Palantir Technologies, consider this: The Motley Fool Stock Advisor analyst team just identified what they believe are the for investors to buy now… and Palantir Technologies wasn't one of them. The 10 stocks that made the cut could produce monster returns in the coming years. Consider when Netflix made this list on December 17, 2004... if you invested $1,000 at the time of our recommendation, you'd have $649,544!* Or when Nvidia made this list on April 15, 2005... if you invested $1,000 at the time of our recommendation, you'd have $1,113,059!* Now, it's worth noting Stock Advisor's total average return is 1,062% — a market-crushing outperformance compared to 185% for the S&P 500. Don't miss out on the latest top 10 list, available when you join Stock Advisor. See the 10 stocks » *Stock Advisor returns as of August 13, 2025 Lawrence Nga has no position in any of the stocks mentioned. The Motley Fool has positions in and recommends Microsoft and Palantir Technologies. The Motley Fool recommends the following options: long January 2026 $395 calls on Microsoft and short January 2026 $405 calls on Microsoft. The Motley Fool has a disclosure policy. Palantir Looks Like a Great Stock. Or Does It? was originally published by The Motley Fool Sign in to access your portfolio

Six in 10 Transportation Management Leaders Believe Agentic AI Will Act on Their Behalf by 2030
Six in 10 Transportation Management Leaders Believe Agentic AI Will Act on Their Behalf by 2030

Yahoo

time17 hours ago

  • Yahoo

Six in 10 Transportation Management Leaders Believe Agentic AI Will Act on Their Behalf by 2030

Six in 10 transportation leaders believe autonomous agentic AI will act on their behalf within the next five years, but just 37 percent of surveyed organizations have 'deeply' integrated artificial intelligence and machine learning into their current transportation management systems (TMS). That's according to data from supply chain technology company Manhattan Associates and research firm Vanson Bourne. While transportation leaders seem to believe in the promise AI could bring to their organizations, they also acknowledge that there are adoption issues that could bar them from implementing agentic AI systems rapidly. Just under half of leaders surveyed said they're highly prepared for AI agents to be in place by 2030, but about half of organizations expect to face skill shortages while integrating the technology. Forty-four percent of leaders said they would be likely to face integration holdups and an additional 44 percent cited questions of data quality and availability. More from Sourcing Journal NY AG Asks Court to Dismiss NRF's Algorithmic Pricing Lawsuit Byte-Sized AI: Google's User Try-On Goes Live; Nike Beta Tests In-App Conversational AI Walmart Shifts Toward Super Agents In Play to Upgrade AI Strategy All of those issues are common as organizations work toward a more technology-focused future; some have already adopted simpler AI systems to help their work along, but many still lack maturity compared to those in their market. Six in 10 organizations use AI-integrated sales and operations planning systems, and more than half use predictive analytics. But markedly fewer organizations cited using quick-turnaround AI tools, like historical trend analysis (38 percent), automated booking (36 percent) and real-time demand sensing (35 percent) technology for those tasks. But as organizations across industries continue to invest in, and rely on, transportation management—particularly against the backdrop of a consumer demanding speed and supply chain costs fluctuating because of political changes—business' interest in transportation has started to see an uptick. While 78 percent of leaders said their organization sees transportation management as a necessity for success today, 86 percent said they know it will be a strong priority by 2030. But transportation transformation still has competing priorities; for instance, nearly seven in 10 leaders said sustainability remains a top priority for their organization. Simultaneously, organizations continue to struggle with insight into their transportation strategies; half of respondents said they have trouble preemptively rerouting shipments as needed, and about half of respondents are actively grappling with the best way to schedule dock and warehouse workers. While agentic AI may be positioned to help companies handle multiple priorities simultaneously, Bryan Smith, director, transportation management systems at Manhattan Associates, said the best way forward is to begin adopting helpful technology now. 'Eighty-seven percent of respondents anticipate that challenges in areas such as operational visibility, AI adoption and sustainability compliance will intensify, leaving their current transportation management systems struggling to keep pace. Failure to act now will expose organizations to rising costs, questions over long-term efficacy, and the risk of falling short of customer promises,' Smith said in a statement. The benefits of beginning early could incentivize companies to work toward at-scale adoption; more than 80 percent of leaders said they believe that technology advances could decrease the cost of freight by at least 5 percent by 2030. What's more, six in 10 companies indicated that stronger visibility and insight into transportation systems fosters stronger satisfaction among customers. Smith said whether or not companies choose to change alongside it, the market will continue to shift. Manhattan recognizes that many organizations have a ways to go on implementation, but that, in order to stay relevant with customers, companies need to consider prioritizing technology before they fall behind competitors. 'Modern transportation management demands organizations balance a range of competing priorities, and the research clearly illustrates many organizations are still unprepared to meet the challenges of evolving sustainability mandates, expectations around AI and the need for more visible, actionable data insights. Looking ahead to 2030, these demands will intensify, increasing the pressure on organizations to operate transportation operations in smarter more intuitive ways,' Smith said. Sign in to access your portfolio

Will Colgate's Innovation Drive Market Share Gains in 2025?
Will Colgate's Innovation Drive Market Share Gains in 2025?

Yahoo

timea day ago

  • Yahoo

Will Colgate's Innovation Drive Market Share Gains in 2025?

Colgate-Palmolive Company's CL innovation strategy for 2025 is tightly linked to its ambition to boost market share despite ongoing macroeconomic and category growth challenges. The company is leaning heavily into impactful innovation as a key building block for long-term growth, pairing it with data analytics, AI and omni-channel demand generation to accelerate its 2030 strategic plan. Across regions, Colgate is introducing core and premium products that not only strengthen brand health but also expand household penetration. For instance, in oral care, launches like Colgate Max White Clinical in the U.K. and Miracle Repair serum-infused toothpaste in China are being positioned at strategic price points to attract new users while reinforcing the company's premium innovation drive is not limited to oral care but extends across personal and home care as well as Hill's Pet Nutrition. In Europe, personal care offerings like Sanex Skin Therapy with Amino Acid Complex are addressing specific skin needs, while in Latin America, relaunches of the Protex bar soap and Suavitel Superior Care fabric conditioner are designed to strengthen category leadership. In Hill's Pet Nutrition segment, science-led innovations, such as ActivBiome+ Multi-Benefit in Science Diet and new Prescription Diet formulas, target niche therapeutic needs, appealing to veterinarians and pet owners seeking specialized nutrition approach is also rooted in premiumization, elevating its product mix to capture higher-margin sales while differentiating from value-tier competitors. Brands like Colgate Total, Luminous White, Sensitive Pro Relief and elmex are leading premium growth, particularly in markets like Latin America and Europe, where consumers continue to trade up for performance and specialized benefits. These premium launches are supported by robust advertising investment, which remains over 13% of net sales, reinforcing consumer awareness and trust at a time when competitive activity is ahead, the company's 2025 guidance acknowledges weaker category growth in the first half and potential headwinds from raw material inflation. However, management expects the combination of innovation-led volume gains, targeted pricing strategies and productivity-driven reinvestment to help offset these pressures. If executed effectively, Colgate's innovation pipeline, spanning oral care breakthroughs, personal and home care upgrades, and advanced pet nutrition, could serve as the primary catalyst for market share gains in 2025, solidifying its competitive positioning in both mature and emerging markets. CL's Competition in the Global Arena Church & Dwight Co. Inc. CHD, The Procter & Gamble Company PG and The Clorox Company CLX are key companies competing with Colgate in the global & Dwight is driving growth through innovation in both premium and value segments, highlighted by the Touchland acquisition in hand sanitizers, Batiste Light in dry shampoo, the Therabreath mouthwash expansion, and HERO's Mighty Patch Body launch. These initiatives, along with rising online sales, aim to boost household penetration and brand share despite category and retailer headwinds for Church & & Gamble is leveraging its 'irresistible superiority' strategy with launches like Tide evo plastic-free detergent, Swiffer Sweep and Mop Deluxe, Pampers premium upgrades, and SK-II's LXP line. Digital-first marketing and a strong track record in top new product rankings underpin Procter & Gamble's ability to grow categories and outpace is advancing innovation across its portfolio to support long-term growth, focusing on value-driven offerings and sustainability. In cleaning, the company is rolling out bleach innovations with enhanced disinfection claims while expanding Burt's Bees into new lip and skin care formats. Clorox is also innovating in Glad trash bags with stronger, more sustainable materials and launching Brita products with improved filtration performance. These initiatives aim to strengthen brand relevance, capture new users and reinforce category leadership amid a competitive and cost-sensitive environment. Want the latest recommendations from Zacks Investment Research? Today, you can download 7 Best Stocks for the Next 30 Days. Click to get this free report Procter & Gamble Company (The) (PG) : Free Stock Analysis Report Colgate-Palmolive Company (CL) : Free Stock Analysis Report The Clorox Company (CLX) : Free Stock Analysis Report Church & Dwight Co., Inc. (CHD) : Free Stock Analysis Report This article originally published on Zacks Investment Research ( Zacks Investment Research Error in retrieving data Sign in to access your portfolio Error in retrieving data Error in retrieving data Error in retrieving data Error in retrieving data

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store