Data Pipelines Pocket Reference: Moving and Processing Data for Analytics : Densmore, James by James Densmore


Data Pipelines Pocket Reference: Moving and Processing Data for Analytics : Densmore, James
Title : Data Pipelines Pocket Reference: Moving and Processing Data for Analytics : Densmore, James
Author :
Rating :
ISBN : -
Format Type : Kindle , Hardcover , Paperback , Audiobook & More
Number of Pages : 274

Data pipelines are the foundation for success in data analytics. Moving data from numerous diverse sources and transforming it to provide context is the difference between having data and actually gaining value from it. This pocket reference defines data pipelines and explains how they work in today's modern data stack.You'll learn common considerations and key decision points when implementing pipelines, such as batch versus streaming data ingestion and build versus buy. This book addresses the most common decisions made by data professionals and discusses foundational concepts that apply to open source frameworks, commercial products, and homegrown solutions.You'll learn:What a data pipeline is and how it worksHow data is moved and processed on modern data infrastructure, including cloud platformsCommon tools and products used by data engineers to build pipelinesHow pipelines support analytics and reporting needsConsiderations for pipeline maintenance, testing, and alerting


Data Pipelines Pocket Reference: Moving and Processing Data for Analytics : Densmore, James Reviews


  • Paloma Bareli Ribeiro

    O livro se destina a fazer uma revisão dos principais conceitos/práticas no desenvolvimento de uma boa pipeline de dados. É uma referência (não uma Bíblia e/ou step by step).Leitura bem leve e com exercícios práticos (não destinados para ambientes de Produção em alguns dos exemplos). Mas vale a compra.

  • Poonam Bhargava

    All the examples are relevant, easy to relate and understand. The books touches all topics with enough coverage to actually have it as a quick reference. Different frameworks for data ingestion(extraction and loading) and transformation in modern data infrastructure. Python scripts are provided and different approaches.

  • Simon Ringuet

    This is the best technical book I have read in a while. If you are working in data engineering, analysis or related fields you should buy this book right now, and if you are a team leader you should get the book for your team members.This is an invaluable reference for all kinds of common problems and tasks that you'll encounter daily that you will refer for years to come.The practical pocket size is fantastic if you want to put it in your bag and carry it around everywhere, but it's also an advantage over bigger books in that it needs to be clear and concise as it doesn't have any room for 'filler'.

  • Cliente goodreads

    The book is a quick introduction to data engineering. The focus is on the most important principles, rather than on specific technologies, that are always evolving.The technologies and services that are introduced are very well established in the data engineering community. Throughout the book, there are many code samples in Python and SQL, that will serve as a basis to build your own pipelines. To make the most out of this book, I would say that basic to intermediate Python and SQL knowledge is required.

  • Mediasuche

    Das Buch stellt eine gelungene Einführung in die Themen rund um Data Pipelines dar und verschafft dem Leser einen guten Überblick.Der Anspruch ist hierbei nicht, besonders in die Tiefe zu gehen und möglichst viele Details über die einzelnen Themen zu liefern, sondern einen möglichst breiten Überblick zu erschaffen und den Leser in die Lage zu versetzten, von da aus an selbst weiter in den einzelnen Themengebieten ein tiefgreifenderes Wissen zu erarbeiten.