Whether to use Kubernetes or not is the question. This takes me back to the old Hadoop argument. People used to ask me to set up Hadoop clusters for them. As soon as I enquired how much data they had, it became immediately apparent that… Read More »Should I be using Kubernetes?
Welcome to 2024. You made it and this year is going to be big for Spark, Lakehouses, Stream processing engines and streaming data in general. I’ve had O’Reilly’s Stream Processing with Apache Spark, Streaming Systems and Stream Processing with Apache Flink on my shelves for… Read More »Data frameworks in 2024 – Which do you pick?
In today’s data-driven world, the ability to efficiently process, manage, and analyze data is not just a competitive edge; it’s a necessity. This is why we recently hosted a livestream (watch it here) to dive deep into Canonical’s Data Fabric platform, a solution that is… Read More »Unlocking the Power of Data with Canonical’s Data Fabric: Insights from Our Latest Livestream
DBT (Data Build Tool) is a powerful open-source tool designed specifically for data scientists like you. It enables you to transform, model, and analyze your data with ease, providing a structured and scalable approach. By utilizing DBT, you can achieve reliable, maintainable, and reproducible analytics.