In this article, Ashutosh Kumar discusses the emergence of modern data solutions that have led to the development of ELT and ETL with unique features and advantages. ELT is more popular due to its ability to handle large and unstructured datasets like in data lakes. Traditional ETL has evolved into cloud-based ETL which allows rapid batch processing, scalability, savings, and simplicity while maintaining security, governance, and compliance.
Why Do We Prefer ELT Rather than ETL in the Data Lake? What is the Difference between ETL & ELT
Zilliz Cloud Slashes Prices, Launches Free Tier to Democratize Vector Databases
Zilliz Cloud was already a fast vector database. Now it’s even more affordable. And you can’t get more affordable than free. The latest version of Zilliz Cloud, the managed service from the inventors of Milvus, includes a raft of new features along with additional pricing options for individuals and organizations. It makes vector databases accessible for projects of all sizes. Developers can now, regardless of budget, spin up generative AI applications while protecting against hallucination.
Kinetica Announces Conversational Query – ChatGPT Integration with Analytic Database
Kinetica announced an analytic database to integrate with ChatGPT, ushering in ‘conversational querying.’ Users can ask any question of their proprietary data, even complex ones that were not previously known, and receive an answer in seconds. The combination of ChatGPT’s front-end interface that converts natural language to Structured Query Language (SQL), and Kinetica’s analytic database purpose built for true ad-hoc querying at speed and scale, provides a more intuitive and interactive way of analyzing complex data sets.
DataStax Acquires Machine Learning Company Kaskada to Unlock Real-Time AI
DataStax, the real-time AI company, announced it has acquired Kaskada, a machine learning (ML) company that first solved managing, storing and accessing time-based data to train behavioral ML models and deliver the instant, actionable insights that fuel artificial intelligence (AI). Both DataStax and Kaskada have a track record of contributing to open source communities. Datastax will open source the core Kaskada technology initially, and it plans to offer a new machine learning cloud service later this year.
Snowflake vs. Databricks – Who has the Edge?
Fivetran recently unveiled the results of a new data warehouse benchmark report that revealed just how close the competition is among five of the most popular data warehouses. The report is the passion project of Fivetran CEO and data management expert George Fraser, who has a front row seat in the cloud data warehouse race. The report explains the cost vs. performance tradeoffs of each one of the warehouses, the ins and outs of the modern data stack, and provides a perspective on how it’s all going to shake out.
How to Ensure an Effective Data Pipeline Process
In this contributed article, Rajkumar Sen, Founder and CTO at Arcion, discusses how the business data in a modern enterprise is spread across various platforms and formats. Data could belong to an operational database, cloud warehouses, data lakes and lakehouses, or even external public sources. Data pipelines connecting this variety of sources need to establish some best practices so that the data consumers get high-quality data delivered to where the data apps are being built.
eBook: Unlock Complex and Streaming Data with Declarative Data Pipelines
Our friend, Ori Rafael, CEO of Upsolver and advocate for engineers everywhere, released his new book “Unlock Complex and Streaming Data with Declarative Data Pipelines.” Ori discusses why declarative pipelines are necessary for data-driven businesses and how they help with engineering productivity, and the ability for businesses to unlock more potential from their raw data. Data pipelines are essential to unleashing the potential of data and can successfully pull from multiple sources.
The Right Way to Get Started with PostgreSQL
In this contributed article, Igor Levshin, Director of Content of Postgres Professional, suggests that as with all database systems, anyone just starting to learn about PostgreSQL can benefit from a clear, incremental approach to developing a strong skillset. This article outlines such an approach, which is also developed in far more detail – including step- by-step instructions and code samples – in “Postgres. The First Experience,” a free, downloadable book by Pavel Luzanov, Egor Rogov, and Igor Levshin.
Instaclustr Including PostgreSQL in Managed Data Platform – Now in Public Preview
Instaclustr, delivering reliability at scale through its fully managed platform for open source data technologies, announced the addition of PostgreSQL to its Managed Platform, now available in public preview for Instaclustr customers. Managed PostgreSQL offers complete database management and optimization, along with comprehensive support and monitoring backed by Instaclustr’s team of PostgreSQL experts.
SingleStore Research Highlights Spike in Data Demands Amid COVID-19 Pandemic
Many aspects of life and work stopped or slowed down significantly during the pandemic. But new research from SingleStore, the unified database for fast analytics, indicates that data requirements in the age of COVID-19 have been greater than ever. This research is based on a 500-person survey of IT professionals that Propeller Insights conducted in January 2021 on behalf of SingleStore.