
Hyperscaling SQL with Sam Lambert
Databases underpin almost every user experience on the web, but scaling a database is one of the most fundamental infrastructure challenges...
Radio and PodcastLive Radio & PodcastsOpening Radio and Podcast...

Radio and PodcastLive Radio & PodcastsFetching podcast shows and categories...
Radio and PodcastLive Radio & PodcastsFetching podcast episodes...

Databases and data engineering episodes of Software Engineering Daily

Databases underpin almost every user experience on the web, but scaling a database is one of the most fundamental infrastructure challenges...

Java is one of the most widely used programming languages, and a key contributor to its success is VMware Tanzu’s Spring, the most common fr...

Apache Iceberg is an open source high-performance format for huge data tables. Iceberg enables the use of SQL tables for big data, while mak...

Starburst is a data lake analytics platform. It’s designed to help users work with structured data at scale, and is built on the open...

Building scalable software applications can be complex and typically requires dozens of different tools. The engineering often involves hand...

SurrealDB is the result of a long-time collaboration between brothers Tobie and Jaime Morgan Hitchcock. The project has modest origins and s...

Maritime logistics is the process organizing the movement of goods across the ocean. Historically, this has been a challenging problem becau...

Data breaches at major companies are so now common that they hardly make the news. The Wikipedia page on data breaches lists over 350 betwee...

If you’re a sports fan and like to track sports statistics and results, you’ve probably heard of Sofascore. The website started...

Cloud-based software development platforms such as GitHub Codespaces continue to grow in popularity. These platforms are attractive to enter...

Knowledge graphs are an intuitive way to define relationships between objects, events, situations, and concepts. Their ability to encode thi...

Observability software helps teams to actively monitor and debug their systems, and these tools are increasingly vital in DevOps. However, i...

The importance of data teams is undeniable. Most companies today use data to drive decision-making on anything from software feature develop...

Today it’s estimated there are over 1 billion websites on the internet. Much of this content is optimized to be viewed by human eyes, not co...

There are hundreds of observability companies out there, and many ways to think about observability, such as application performance monitor...

It’s now clear that the adoption of AI will continue to increase, with nearly every industry working to rapidly incorporate it into their sy...

ScyllaDB is a fast and highly scalable NoSQL database designed to provide predictable performance at a massive cloud scale. It can handle mi...

Database caching is a fundamental challenge in database management and there are hundreds of techniques to satisfy different caching scenari...

Companies have high hopes for Machine learning and AI to support real-time product offerings, prevent fraud and drive innovation. But there...

RudderStack is a warehouse-native customer data platform (CDP) that helps businesses collect, unify, and activate customer data from all the...

The state of Data inside most companies is chaotic. It takes significant time and investment to tame this chaos. When you are a platform pro...

As companies depend more on data to improve digital products and make informed decisions, it’s crucial that the data they use be accur...

In this podcast episode, we take a look at the intricacies of low-code data pipelines with Raj Bains, the founder of Prophecy.io. Raj shares...

Chroma is an open source embedding database that is designed to make it easy to build large language model applications by making knowledge,...

Data Activation is the method of unlocking the knowledge sorted within your data warehouse, and making it actionable by your business users...

A data catalog provides an index into the data sets and schemas of a company.Data teams are growing in size, and more companies than ever ha...

Streaming analytics refers to the process of analyzing real-time data that is generated continuously and rapidly from various sources, such...

Distributed databases are necessary for storing and managing data across multiple nodes in a network. They provide scalability, fault tolera...

DataSet is a log analytics platform provided by Sentinel One that helps DevOps, IT engineering, and security teams get answers from their da...

There are many types of early stage funding available from friends and family to seed to series A. Some firms invest across a wide set of te...

The Presto/Trino project makes distributed querying easier across a variety of data sources. As the need for machine learning and other high...

Building and managing data-intensive applications has traditionally been costly and complex, and has placed an operational burden on develop...

Data analytics technology and tools have seen significant improvements in the past decade. But, it can still take weeks to prototype, build...

Data is becoming a bank’s biggest asset. These complex enterprises have a huge opportunity ahead – to transform themselves to become a...

Ian Coe CEO Adam Kamor Head of Engineering Companies that gather data about their users have an ethical obligation and legal responsibility...

Couchbase is a distributed NoSQL cloud database. Since its creation, Couchbase has expanded into edge computing, application services, and m...

Streaming data platforms like Kafka, Pulsar, and Kinesis are now common in mainstream enterprise architectures, providing low-latency real-t...

  Data-as-a-service is a company category type that is not as common as API-as-a-service, software-as-a-service, or platform-as-a-servi...

Data labeling allows machine learning algorithms to find patterns among the data. There are a variety of data labeling platforms that enable...

Real-time analytics are difficult to achieve because large amounts of data must be integrated into a data set as that data streams in. As th...

Data loss can occur when large data sources such as Slack or Google Drive get leaked. In order to detect and avoid leaks, a data asset graph...

Data integration infrastructure is not easy to build. Moving large amounts of data from one place to another has historically required devel...

Modern organizations eventually face data governance challenges. Keeping track of where data came from, what systems update it, in what ways...

The solution many turn to for capturing their streaming data is InfluxDB. In this episode, I interview Brian Gilmore, Director of Product Ma...

Lior Gavish James Densmore Data infrastructure is a fast-moving sector of the software market. As the volume of data has increased, so too h...

Running a database company requires expertise in both technical and managerial skills. There are deeply technical engineering questions arou...

SingleStore is a multi-use, multi-model database designed for transactional and analytic workloads, as well as search and other domain speci...

DuckDB is a relational database management system with no external dependencies, with a simple system for deployment and integration into bu...

Customer data pipelines power the backend of many successful web platforms. In a customer data pipeline, data is collected from sources such...

The data lake architecture has become broadly adopted in a relatively short period of time. In a nutshell, that means data in it’s raw forma...