
Join The Data Movement Movement
Almost all data moves. Although there is a formalized classification of 'data at rest' to denote those information resources that are not on the move between applications, databases, analytics engines and other service components, most data at some point in its life is moved around and some of it is transmitted in real-time.
But data movement is now becoming a technology classification in and of itself. This is in no small part down to the rise of artificial intelligence and the need to feed AI models with information spanning large and small language models, alongside regular systems data that AI needs to operate, interconnect and apply the requisite levels of governance and compliance to itself.
Logically then, moving data is risky. This is because data in transport doesn't necessarily break as such (although connection between different components of a given software system can lead to corruptions), we're more concerned with making sure data on the move doesn't end up in the wrong place and isn't sent to machine entities that don't have access policy rights.
Airbyte is a company that describes itself as an open source data movement platform specialist. Its technology works to move data at scale for AI and analytics workloads, while ensuring governance so that organizations spend less time managing data pipelines while unlocking value from data.
'We make it possible for organizations to protect their data, ensuring that it doesn't accidentally become accessible outside the organization by consumers of AI models,' said Michel Tricot, co-founder and CEO, Airbyte. '[The technology] we're delivering helps eliminate data silos and improves data accessibility, while still ensuring security and compliance to maintain data sovereignty without adding operational overhead.'
The company is now updating its products designed to provide organizations with additional support for data movement while retaining sovereignty over their own first-party data with enhanced security, speed and enhanced resource management. This includes support for unstructured data and portable data lake formats, which are repositories typically used for the mass storage of raw data streams (think of them more like a holding pen than a dumping ground) before they become managed and structured for onward use.
In technology terms this includes support for the Apache Iceberg open standard for moving data into modern lakehouse architectures, which are the backbone for AI workloads with large language models as well as modern analytics. The company is also offering file transfer support for Google Drive, SharePoint and OneDrive for the movement of unstructured data such as PDF, video and image files (along with their metadata and permissions), making all of this data accessible for AI.
Tricot and team say that they are making moving data easy and affordable across nearly any source and destination, ensuring enterprises have accurate, timely data for analysis and decision-making. With over 900 contributors and a community of more than 230,000 members, Airbyte supports a large data engineering community that works with an industry-wide open data movement platform.
A new Mappers feature enables users to perform 'lightweight data transformations' (comparatively simple changes to data resources such as deduplicating, formatting, parsing or restructuring to achieve data consistency when different information resources are dovetailed or integrated, or simply for good housekeeping) directly within the Airbyte interface.
This transformation can also include hashing (changing all data fields to a fixed length, typically a shorter one), encrypting, renaming fields and filtering rows to help organizations maintain compliance with data privacy regulations like GDPR and HIPAA.
Airbyte is also providing an enterprise connector bundle as a complement to its Airbyte Cloud Teams and Self-Managed Enterprise brands. The bundle includes connectors for NetSuite, Oracle database with Change Data Capture, SAP HANA, ServiceNow and Workday.
'This bundle of connectors streamlines how the world's largest organizations access their most valuable financial, operational and human resource data. The Airbyte Enterprise products ensure that organizations can easily and securely extract critical data from complex and sensitive sources with governance controls for data privacy and compliance,' said Tricot.
Looking ahead, we may see the data movement movement extend and finesse itself in the same way that house removal businesses do. We can hire a major removal firm for a whole house move, or we can hire a 'man with a van' for smaller tasks. We can hire a down and dirty cheap low-end service to lump concrete or broken down wall materials around, or we can hire bespoke white glove services that promise to transport our porcelain tea sets all in one piece and make sure they get to their intended destination. Either way, whether its data or homewares, make sure you lay down a dust sheet.

Try Our AI Features
Explore what Daily8 AI can do for you:
Comments
No comments yet...
Related Articles
Yahoo
7 hours ago
- Yahoo
Google (GOOGL) Resolves Global Outage That Disrupted Gmail, Cloud, and Partner Apps
Google (GOOGL, Financials) said Thursday it resolved a brief but widespread outage that disrupted several of its services including Gmail, Google Drive, Meet, and Cloud and cascaded into apps like Spotify, Snapchat, and Discord that rely on its infrastructure. Warning! GuruFocus has detected 3 Warning Signs with AMD. The disruption, which began around 1:50 p.m. ET, sparked tens of thousands of user reports on Spotify logged over 46,000 outage reports at its peak; Discord saw nearly 11,000. By evening, most services had recovered. Google said it will release a detailed analysis after completing its internal investigation. A Google Cloud dashboard update confirmed that while most systems were back online, a few services may still experience residual effects. Though brief, the outage exposed the growing dependency of third-party apps on Google's cloud infrastructure raising fresh questions about redundancy and resilience in today's digital stack. This article first appeared on GuruFocus.


Tom's Guide
2 days ago
- Tom's Guide
Gemini can turn text into audio overviews — here's how to do it
You may already be familiar with NotebookLM, Google's Gemini-powered research companion. If you're new to it, it's worth exploring its standout feature called Audio Overview, which takes uploaded information and turns it into a podcast hosted by two AI presenters. This feature is, in many ways, one of the main reasons for using NotebookLM, allowing you to more easily digest even the most complex of information. But the feature has been airing elsewhere. Indeed, you'll now find it in Google's AI assistant Gemini either on Android or iOS. It works in the exact same way but it plays the audio in your browser. Let's check it out. Launch the Gemini app and, in the Ask Gemini box, tap the Plus button. Next, tap Files and select a document you'd like Gemini to work with. You will be able to look through files in Google Drive and lots of file types are supported including DOC, DOCX PDF, RFT and TXT. Once the file has been uploaded, tap Generate Audio Overview. You will need to wait a few minutes while your file is being processed but eventually you will get a result. You don't actually have to stay in the chat — you will be notified when it's ready. You can now tap the Audio Overview in order to listen to it — you may need to tap play. You can find the Audio Overview at any time if you select the Chats and Gems icon in the top-left of the screen. You can share the Audio Overview from this playback screen. Just tap the three-dot icon in the top-right of the screen and tap the Download icon (a downward-pointing icon at the top of the menu). You can also tap Share and select a method such as Messages, email or social media. And there you go. You now know how to generate audio overviews in Gemini, but there's so much else you can do. You can learn how to use Google Gemini to summarize a YouTube video or figure out how to use Gemini AI to create the perfect workout music playlist. It's even possible to discover how to find the best haircut for your face shape. Get instant access to breaking news, the hottest reviews, great deals and helpful tips.


Android Authority
3 days ago
- Android Authority
Gemini in Google Drive gets right to the point with automatic PDF summaries
Andy Walker / Android Authority TL;DR Gemini can now read PDFs in Google Drive and create a summary card for quick viewing. The feature comes with buttons users can click to make Gemini take action on the PDF, such as creating a draft proposal. The update is rolling out now to all Workspace users. Google Drive has been getting a lot of Gemini love recently. Google added the ability for Gemini to browse your files and even watch videos for you. Now, Google is rolling out a new feature that gives users instant summaries of their PDF files. Google introduced PDF summary cards, which are a new AI-driven feature that proactively summarizes PDF content when a file is opened in Google Drive. The summaries themselves include clickable actions like 'Draft a sample proposal' or 'List interview questions based on this resume,' two examples Google gave in their announcement. These actions launch Gemini in a side panel so users can get started on tasks without having to leave the document. This isn't the first time Google has tried to integrate Gemini with PDFs. We've had simple overviews for a while now, but the actionable AI suggestions, and the new card layout, are brand new. Users can double-click any PDF to see the summary card. It appears within the overlay preview window instead of opening in a separate browser tab. Google has been sticking Gemini into everything, from summarizing reports in Docs to drafting replies and email templates in Gmail. It can organize tasks in Calendar and analyze a spreadsheet in Sheets. Those are all great, but PDF summaries in Drive could be particularly useful for everyone's productivity. PDF summaries in Drive could be particularly useful for everyone's productivity. You don't need to use the new AI summaries of your Drive PDFs. You can update your Drive settings to continue opening PDFs in a separate browser if that is what you prefer. The feature supports over 20 languages at launch. It is available to Google Workspace customers on Business Standard, Enterprise Standard and Plus, and users with the Gemini Education plan. It is also available for anyone with the old AI Pro or Ultra add-ons.