In Designing Cloud Data Platforms, Danil Zburivsky and Lynda Partner reveal a six-layer approach that increases flexibility and reduces costs. Discover patterns for ingesting data from a variety of sources, then learn to harness pre-built services provided by cloud vendors.
Summary
Centralized data warehouses, the long-time defacto standard for housing data for analytics, are rapidly giving way to multi-faceted cloud data platforms. Companies that embrace modern cloud data platforms benefit from an integrated view of their business using all of their data and can take advantage of advanced analytic practices to drive predictions and as yet unimagined data services. Designing Cloud Data Platforms is a hands-on guide to envisioning and designing a modern scalable data platform that takes full advantage of the flexibility of the cloud. As you read, you’ll learn the core components of a cloud data platform design, along with the role of key technologies like Spark and Kafka Streams. You’ll also explore setting up processes to manage cloud-based data, keep it secure, and using advanced analytic and BI tools to analyze it.
Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.
About the technology
Well-designed pipelines, storage systems, and APIs eliminate the complicated scaling and maintenance required with on-prem data centers. Once you learn the patterns for designing cloud data platforms, you’ll maximize performance no matter which cloud vendor you use.
About the book
In Designing Cloud Data Platforms, Danil Zburivsky and Lynda Partner reveal a six-layer approach that increases flexibility and reduces costs. Discover patterns for ingesting data from a variety of sources, then learn to harness pre-built services provided by cloud vendors.
What's inside
Best practices for structured and unstructured data sets
Cloud-ready machine learning tools
Metadata and real-time analytics
Defensive architecture, access, and security
About the reader
For data professionals familiar with the basics of cloud computing, and Hadoop or Spark.
About the author
Danil Zburivsky has over 10 years of experience designing and supporting large-scale data infrastructure for enterprises across the globe. Lynda Partner is the VP of Analytics-as-a-Service at Pythian, and has been on the business side of data for over 20 years.
Table of Contents
1 Introducing the data platform
2 Why a data platform and not just a data warehouse
3 Getting bigger and leveraging the Big 3: Amazon, Microsoft Azure, and Google
4 Getting data into the platform
5 Organizing and processing data
6 Real-time data processing and analytics
7 Metadata layer architecture
8 Schema management
9 Data access and security
10 Fueling business value with data platforms
eBook License
End-User Warranty and License Agreement
1. Grant of License
Manning has authorized the download by you of an unrestricted number of copies of the electronic book (ebook) in any of the available formats. Manning grants you a nonexclusive, nontransferable license to use the ebook according to the terms and conditions herein. This License Agreement permits you to install the ebook on any and all your devices for your personal use only.
2. Restrictions
You shall not: (1) share, resell, rent, assign, timeshare, distribute, or transfer all or part of the ebook or any rights granted hereunder to any other person; (2) duplicate the ebook, except for a single backup or archival copy; (3) remove any proprietary notices, labels, or marks from the ebook; (4) transfer or sublicense title to the ebook to any other party.
3. Intellectual Property Protection
The ebook is owned by Manning and is protected by United States and international copyright and other intellectual property laws. Manning reserves all rights in the ebook not expressly granted herein. This license and your right to use the ebook terminate automatically if you violate any part of this Agreement. In the event of termination, you must remove the original and any copies of the ebook from all your devices.
4. Source Code Supplementary Material
Any source code files provided as a supplement to the book are freely available to the public for download. Reuse of the code is permitted, in whole or in part, including the creation of derivative works, provided that you acknowledge that you are using it and identify the source: title, publisher and year.
5. Limited Warranty
Manning warrants that the ebook files, a copy of which you are authorized to download, are free from defects in the operational sense that they can be read by a PDF Reader or ePub reader, or other. EXCEPT FOR THIS EXPRESS LIMITED WARRANTY, MANNING MAKES AND YOU RECEIVE NO WARRANTIES, EXPRESS, IMPLIED, STATUTORY OR IN ANY COMMUNICATION WITH YOU, AND MANNING SPECIFICALLY DISCLAIMS ANY OTHER WARRANTY INCLUDING THE IMPLIED WARRANTY OF MERCHANTABILITY OR FITNESS OR A PARTICULAR PURPOSE. MANNING DOES NOT WARRANT THAT THE OPERATION OF THE EBOOK WILL BE UNINTERRUPTED OR ERROR FREE. If the ebook was purchased in the United States, the above exclusions may not apply to you as some states do not allow the exclusion of implied warranties. In addition to the above warranty rights, you may also have other rights that vary from state to state.
6. Limitation of Liability
IN NO EVENT WILL MANNING BE LIABLE FOR ANY DAMAGES, WHETHER ARISING FOR TORT OR CONTRACT, INCLUDING LOSS OF DATA, LOST PROFITS, OR OTHER SPECIAL, INCIDENTAL, CONSEQUENTIAL, OR INDIRECT DAMAGES ARISING OUT OF THE USE OR INABILITY TO USE THE EBOOK.
7. General
This Agreement constitutes the entire agreement between you and Manning and supersedes any prior agreement concerning the ebook. This Agreement is governed by the laws of the State of New York.