Ashish Thusoo

Creating a Data-Driven Enterprise in Media

2018-04-15 · O'Reilly Data Science Books O'Reilly Amazon

book

with Ashish Thusoo , Joydeep Sen Sarma

data data-science Analytics Big Data Cloud Computing DataOps

The data-driven revolution is finally hitting the media and entertainment industry. For decades, broadcast television and print media relied on traditional delivery channels for solvency and growth, but those channels fragmented as cable, streaming, and digital devices stole the show. In this ebook, you’ll learn about the trends, challenges, and opportunities facing players in this industry as they tackle big data, advanced analytics, and DataOps. You’ll explore best practices and lessons learned from three real-world media companies—Sling TV, Turner Broadcasting, and Comcast—as they proceed on their data-driven journeys. Along the way, authors Ashish Thusoo and Joydeep Sen Sarma explain how DataOps breaks down silos and connects everyone who handles data, including engineers, data scientists, analysts, and business users. Big-data-as-a-service provider Qubole provides a five-step maturity model that outlines the phases that a company typically goes through when it first encounters big data. Case studies include: Sling TV: this live streaming content platform delivers live TV and on-demand entertainment instantly to a variety of smart televisions, tablets, game consoles, computers, smartphones, and streaming devices Turner Broadcasting System: this Time Warner division recently created the Turner Data Cloud to support direct-to-consumer services, including FilmStruck, Boom (for kids), and NBA League Pass Comcast: the largest broadcasting and cable TV company is building a single integrated big data platform to deliver internet, TV, and voice to more than 28 million customers

Data as a Feature

2018-03-15 · O'Reilly Data Science Books O'Reilly Amazon

book

with Matt LeMay , Ashish Thusoo

data data-science

Business applications are evolving and user expectations for quality, easy-to-use software is at an all-time high. The consumerization of applications is making the role of product managers more difficult than ever. How do you build products or services that meet demands for both power and simplicity? Companies are now gaining competitive advantage by providing intuitive application experiences that help users achieve goals. The best applications—the ones that stick—are those that empower users to realize the full value of their data. In this book, we explore how treating data as a feature can help product managers create indisposable applications that help users solve their most critical goals. Understand your users’ goals, the data they’ll need to achieve them, where the data comes from, and how to visualize data effectively Use personas to help you keep users in mind when making critical development and design decisions Provide an interwoven data experience by immersing reports, dashboards, and visualizations into your applications Make your data “over-the-counter” so that you and your users can accurately and easily interpret it Learn how to manage your data roadmap and handle requests for additional features

Creating a Data-Driven Enterprise with DataOps

2017-03-15 · O'Reilly Data Science Books O'Reilly Amazon

book

with Ashish Thusoo , Joydeep Sen Sarma

data data-science DataOps Big Data Cloud Computing DevOps

Many companies are busy collecting massive amounts of data, but few are taking advantage of this treasure horde to build a truly data insights-driven organization. To do so, the data team must democratize both data and the insights in a way that provides real-time access to all employees in the organization. This report explores DataOps, the process, culture, tools, and people required to scale big data pervasively across the enterprise. Just as DevOps has enabled organizations to improve coordination between developers and the operations team, DataOps closely connects everyone who handles data, including engineers, data scientists, analysts, and business users. Democratizing data with this approach requires removing barriers typical of siloed data, teams, and systems. In this report, Apache Hive creators Ashish Thusoo and Joydeep Sen Sarma examine the characteristics of a data-driven organization that supports a self-service model. Explore related topics such as data lakes, metadata, cloud architecture, and data-infrastructure-as-a-service Examine conclusions from a survey of more than 400 senior executives whose companies are in various stages of data maturity Learn how data pioneers at Facebook, Uber, LinkedIn, Twitter, and eBay created data-driven cultures and self-service data infrastructures for their organizations

The Big Data Transformation

2016-11-15 · O'Reilly Data Engineering Books O'Reilly Amazon

book

data data-engineering Analytics Big Data Data Analytics DWH

Business executives today are well aware of the power of data, especially for gaining actionable insight into products and services. But how do you jump into the big data analytics game without spending millions on data warehouse solutions you don’t need? This 40-page report focuses on massively parallel processing (MPP) analytical databases that enable you to run queries and dashboards on a variety of business metrics at extreme speed and Exabyte scale. Because they leverage the full computational power of a cluster, MPP analytical databases can analyze massive volumes of data—both structured and semi-structured—at unprecedented speeds. This report presents five real-world case studies from Etsy, Cerner Corporation, Criteo and other global enterprises to focus on one big data analytics platform in particular, HPE Vertica. You’ll discover: How one prominent data storage company convinced both business and tech stakeholders to adopt an MPP analytical database Why performance marketing technology company Criteo used a Center of Excellence (CoE) model to ensure the success of its big data analytics endeavors How YPSM uses Vertica to speed up its Hadoop-based data processing environment Why Cerner adopted an analytical database to scale its highly successful health information technology platform How Etsy drives success with the company’s big data initiative by avoiding common technical and organizational mistakes

Architecting Data Lakes

2016-04-15 · O'Reilly Data Engineering Books O'Reilly Amazon

book

with Ashish Thusoo , Ben Sharma

data data-engineering storage-repositories data-lake Data Lake Data Management

Many organizations use Hadoop-driven data lakes as an adjunct staging area for their enterprise data warehouses (EDW). But for those companies ready to take the plunge, a data lake is far more useful as a one-stop-shop for extracting insights from their vast collection of data. With this eBook, you’ll learn best practices for building, maintaining, and deriving value from a Hadoop data lake in production environments. Authors Alice LaPlante and Ben Sharma explain how a data lake will enable your organization to manage an increasing volume of datasets—from blog postings and product reviews to streaming data—and to discover important relationships between them. Whether you want to control administrative costs in healthcare or reduce risk in financial services, this ebook addresses the architectural considerations and required capabilities you need to build your own data lake. With this report, you’ll learn: The key attributes of a data lake, including its ability to store information in native formats for later processing Why implementing data management and governance in your data lake is crucial How to address various challenges for building and managing a data lake Self-service options that enable different users to access the data lake without help from IT Emerging trends that will shape the future of data lakes

talk-data.com

Frequent Collaborators

Filter by Event / Source

Creating a Data-Driven Enterprise in Media

Data as a Feature

Creating a Data-Driven Enterprise with DataOps

The Big Data Transformation

Architecting Data Lakes