Introduction To Data Platforms

Download Introduction To Data Platforms PDF/ePub or read online books in Mobi eBooks. Click Download or Read Online button to get Introduction To Data Platforms book now. This website allows unlimited access to, at the time of writing, more than 1.5 million titles, including hundreds of thousands of titles in various foreign languages.
Introduction to Data Platforms

Author: Anthony David Giordano
language: en
Publisher: Fulton Books, Inc.
Release Date: 2022-11-03
Digital, cloud, and artificial intelligence (AI) have disrupted how we use data. This disruption has changed the way we need to provision, curate, and publish data for the multiple use cases in today's technology-driven environment. This text will cover how to design, develop, and evolve a data platform for all the uses of enterprise data needed in today's digital organization. This book focuses on explaining what a data platform is, what value it provides, how is it engineered, and how to deploy a data platform and support organization. In this context, Introduction to Data Platforms reviews the current requirements for data in the digital age and quantifies the use cases; discusses the evolution of data over the past twenty years, which is a core driver of the modern data platform; defines what a data platform is and defines the architectural components and layers of a data platform; provides the architectural layers or capabilities of a data platform; reviews cloud- and commercial-software vendors that populate the data-platform space; provides a step-by-step approach to engineering, deploying, supporting, and evolving a data-platform environment; provides a step-by-step approach to migrating legacy data warehouses, data marts, and data lakes/sandboxes to a data platform; and reviews organizational structures for managing data platform environments.
The Enterprise Big Data Lake

The data lake is a daring new approach for harnessing the power of big data technology and providing convenient self-service capabilities. But is it right for your company? This book is based on discussions with practitioners and executives from more than a hundred organizations, ranging from data-driven companies such as Google, LinkedIn, and Facebook, to governments and traditional corporate enterprises. You’ll learn what a data lake is, why enterprises need one, and how to build one successfully with the best practices in this book. Alex Gorelik, CTO and founder of Waterline Data, explains why old systems and processes can no longer support data needs in the enterprise. Then, in a collection of essays about data lake implementation, you’ll examine data lake initiatives, analytic projects, experiences, and best practices from data experts working in various industries. Get a succinct introduction to data warehousing, big data, and data science Learn various paths enterprises take to build a data lake Explore how to build a self-service model and best practices for providing analysts access to the data Use different methods for architecting your data lake Discover ways to implement a data lake from experts in different industries
Big Data Technologies and Applications

The objective of this book is to introduce the basic concepts of big data computing and then to describe the total solution of big data problems using HPCC, an open-source computing platform. The book comprises 15 chapters broken into three parts. The first part, Big Data Technologies, includes introductions to big data concepts and techniques; big data analytics; and visualization and learning techniques. The second part, LexisNexis Risk Solution to Big Data, focuses on specific technologies and techniques developed at LexisNexis to solve critical problems that use big data analytics. It covers the open source High Performance Computing Cluster (HPCC Systems®) platform and its architecture, as well as parallel data languages ECL and KEL, developed to effectively solve big data problems. The third part, Big Data Applications, describes various data intensive applications solved on HPCC Systems. It includes applications such as cyber security, social network analytics including fraud, Ebola spread modeling using big data analytics, unsupervised learning, and image classification. The book is intended for a wide variety of people including researchers, scientists, programmers, engineers, designers, developers, educators, and students. This book can also be beneficial for business managers, entrepreneurs, and investors.