lakehouse Archives - insideAI News https://insideainews.com/tag/lakehouse/ Illuminating AI's Frontiers: Your Go-To News Destination. Thu, 27 Jun 2024 18:37:51 +0000 en-US hourly 1 https://wordpress.org/?v=6.6.1 https://insideainews.com/wp-content/uploads/2024/06/iain-favicon.png lakehouse Archives - insideAI News https://insideainews.com/tag/lakehouse/ 32 32 136462205 SingleStore Unveils New Offerings to Unfreeze Data Lakehouses, Powering Intelligent Applications  https://insideainews.com/2024/06/27/singlestore-unveils-new-offerings-to-unfreeze-data-lakehouses-powering-intelligent-applications/ https://insideainews.com/2024/06/27/singlestore-unveils-new-offerings-to-unfreeze-data-lakehouses-powering-intelligent-applications/#respond Thu, 27 Jun 2024 09:59:00 +0000 https://insideainews.com/?p=35476 SingleStore, the real-time data platform for you to transact, analyze and contextualize, announced a bi-directional integration with Apache Iceberg that opens up a world of opportunities for building intelligent applications on data lakehouses. This integration addresses the critical challenge faced by enterprises where an estimated 90% of data remains “frozen” in lakehouses and is unusable for powering interactive applications, analytics or AI. ]]> https://insideainews.com/2024/06/27/singlestore-unveils-new-offerings-to-unfreeze-data-lakehouses-powering-intelligent-applications/feed/ 0 35476 The Solution to Data in Motion Is to Just Stop https://insideainews.com/2024/04/22/the-solution-to-data-in-motion-is-to-just-stop/ https://insideainews.com/2024/04/22/the-solution-to-data-in-motion-is-to-just-stop/#respond Mon, 22 Apr 2024 18:02:46 +0000 https://insidebigdata.com/?p=35185 In this contributed article, Sida Shen, product marketing manager, CelerData, discusses how data lakehouse architectures promise the combined strengths of data lakes and data warehouses, but one question arises: why do we still find the need to transfer data from these lakehouses to proprietary data warehouses? In this article, we'll explore how to maximize the efficiency of lakehouses, eliminate data in motion, and streamline data management processes.]]> https://insideainews.com/2024/04/22/the-solution-to-data-in-motion-is-to-just-stop/feed/ 0 35185 Research Highlights: Dremio Demonstrates Data Lakehouse Value with Math-Style Proof and Technical Clarity https://insideainews.com/2023/10/21/research-highlights-dremio-demonstrates-data-lakehouse-value-with-math-style-proof-and-technical-clarity/ https://insideainews.com/2023/10/21/research-highlights-dremio-demonstrates-data-lakehouse-value-with-math-style-proof-and-technical-clarity/#comments Sat, 21 Oct 2023 10:00:00 +0000 https://insidebigdata.com/?p=33702 Dremio, the easy and open data lakehouse, has published "The Data Lakehouse: Data Warehousing and More," a novel research paper now available on arXiv. The paper explores the data lakehouse model, offering modern insights for businesses looking to optimize their data utilization. The idea through this preprint publication is to gather feedback from the open source research and scientific community and make it available to the wider community of practitioners.]]> https://insideainews.com/2023/10/21/research-highlights-dremio-demonstrates-data-lakehouse-value-with-math-style-proof-and-technical-clarity/feed/ 1 33702 Cloudera Expands Open Data Lakehouse for Trusted Enterprise AI  https://insideainews.com/2023/06/27/cloudera-expands-open-data-lakehouse-for-trusted-enterprise-ai/ https://insideainews.com/2023/06/27/cloudera-expands-open-data-lakehouse-for-trusted-enterprise-ai/#respond Tue, 27 Jun 2023 13:00:00 +0000 https://insidebigdata.com/?p=32724 Cloudera, the hybrid data company, announced today an expansion of its Open Data Lakehouse offerings enabling customers to have a foundation for analytic and AI capabilities in their enterprises for all their data - in the cloud and now on-premises. Cloudera was an early proponent of Apache Iceberg, introducing support in its CDP-Public Cloud offering last year and recently rolling out support for Iceberg V2.  Today, Cloudera is announcing support for Apache Iceberg for CDP-Private Cloud, available now as a tech preview and with General Availability later this summer.  Cloudera delivers Iceberg everywhere customer data resides facilitating innovation anywhere.]]> https://insideainews.com/2023/06/27/cloudera-expands-open-data-lakehouse-for-trusted-enterprise-ai/feed/ 0 32724 Databricks Launches Simplified Real-Time Machine Learning for the Lakehouse https://insideainews.com/2023/03/07/databricks-launches-simplified-real-time-machine-learning-for-the-lakehouse/ https://insideainews.com/2023/03/07/databricks-launches-simplified-real-time-machine-learning-for-the-lakehouse/#respond Tue, 07 Mar 2023 15:00:00 +0000 https://insidebigdata.com/?p=31820 Databricks, the lakehouse company, announced the launch of Databricks Model Serving to provide simplified production machine learning (ML) natively within the Databricks Lakehouse Platform. Model Serving removes the complexity of building and maintaining complicated infrastructure for intelligent applications. Now, organizations can leverage the Databricks Lakehouse Platform to integrate real-time machine learning systems across their business, from personalized recommendations to customer service chatbots, without the need to configure and manage the underlying infrastructure.]]> https://insideainews.com/2023/03/07/databricks-launches-simplified-real-time-machine-learning-for-the-lakehouse/feed/ 0 31820 Video Highlights: Modernize your IBM Mainframe & Netezza With Databricks Lakehouse https://insideainews.com/2022/11/03/video-highlights-modernize-your-ibm-mainframe-netezza-with-databricks-lakehouse/ https://insideainews.com/2022/11/03/video-highlights-modernize-your-ibm-mainframe-netezza-with-databricks-lakehouse/#respond Thu, 03 Nov 2022 13:09:00 +0000 https://insidebigdata.com/?p=30797 In the video presentation below, learn from experts how to architect modern data pipelines to consolidate data from multiple IBM data sources into Databricks Lakehouse, using the state-of-the-art replication technique—Change Data Capture (CDC).]]> https://insideainews.com/2022/11/03/video-highlights-modernize-your-ibm-mainframe-netezza-with-databricks-lakehouse/feed/ 0 30797 Cloudera Continues Rapid Pace of Data Fabric and Data Lakehouse Innovation to Extend Data Management Leadership https://insideainews.com/2022/10/15/cloudera-continues-rapid-pace-of-data-fabric-and-data-lakehouse-innovation-to-extend-data-management-leadership/ https://insideainews.com/2022/10/15/cloudera-continues-rapid-pace-of-data-fabric-and-data-lakehouse-innovation-to-extend-data-management-leadership/#respond Sat, 15 Oct 2022 13:00:00 +0000 https://insidebigdata.com/?p=30633 Cloudera, the hybrid data company, announced new hybrid data capabilities that enable organizations to more efficiently move data, metadata, data workloads and data applications across clouds and on premises to optimize for performance, cost and security. Cloudera’s portable data services enable simple, low-risk data workload and data application movement for ultimate data lakehouse optionality.]]> https://insideainews.com/2022/10/15/cloudera-continues-rapid-pace-of-data-fabric-and-data-lakehouse-innovation-to-extend-data-management-leadership/feed/ 0 30633 Cloudera Launches All-in-One Data Lakehouse Cloud Service https://insideainews.com/2022/08/17/cloudera-launches-all-in-one-data-lakehouse-cloud-service/ https://insideainews.com/2022/08/17/cloudera-launches-all-in-one-data-lakehouse-cloud-service/#respond Wed, 17 Aug 2022 14:00:00 +0000 https://insidebigdata.com/?p=30114 Cloudera, the hybrid data company, announced the launch of Cloudera Data Platform (CDP) One, an all-in-one data lakehouse software as a service (SaaS) offering that enables fast and easy self-service analytics and exploratory data science on any type of data. A simple yet powerful cloud service, only CDP One has built-in enterprise security and machine learning (ML) that requires zero cloud, security or monitoring operations staff for lower TCO and reduced risk. ]]> https://insideainews.com/2022/08/17/cloudera-launches-all-in-one-data-lakehouse-cloud-service/feed/ 0 30114 Databricks Announces Major Contributions to Flagship Open Source Projects https://insideainews.com/2022/07/02/databricks-announces-major-contributions-to-flagship-open-source-projects/ https://insideainews.com/2022/07/02/databricks-announces-major-contributions-to-flagship-open-source-projects/#respond Sat, 02 Jul 2022 13:00:00 +0000 https://insidebigdata.com/?p=29715 Databricks announced that the company will contribute all features and enhancements it has made to Delta Lake to the Linux Foundation and open source all Delta Lake APIs as part of the Delta Lake 2.0 release. In addition, the company announced MLflow 2.0, which includes MLflow Pipelines, a new feature to accelerate and simplify ML model deployments. Finally, the company introduced Spark Connect, to enable the use of Spark on virtually any device, and Project Lightspeed, a next generation Spark Structured Streaming engine for data streaming on the lakehouse. ]]> https://insideainews.com/2022/07/02/databricks-announces-major-contributions-to-flagship-open-source-projects/feed/ 0 29715 Databricks Announces General Availability of Delta Live Tables https://insideainews.com/2022/04/06/databricks-announces-general-availability-of-delta-live-tables/ https://insideainews.com/2022/04/06/databricks-announces-general-availability-of-delta-live-tables/#respond Wed, 06 Apr 2022 13:00:00 +0000 https://insidebigdata.com/?p=28936 Databricks, the Data and AI company and pioneer of the data lakehouse paradigm, announced the general availability of Delta Live Tables (DLT), the first ETL framework to use a simple declarative approach to build reliable data pipelines and to automatically manage data infrastructure at scale. Turning SQL queries into production ETL pipelines often requires a lot of tedious, complicated operational work. By using modern software engineering practices to automate the most time consuming parts of data engineering, data engineers and analysts can concentrate on delivering data rather than on operating and maintaining pipelines.]]> https://insideainews.com/2022/04/06/databricks-announces-general-availability-of-delta-live-tables/feed/ 0 28936