data

Practical Anonymity

2013-07-19 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Peter Loshin

Cyber Security data-engineering data-security-privacy data security & privacy

For those with legitimate reason to use the Internet anonymously--diplomats, military and other government agencies, journalists, political activists, IT professionals, law enforcement personnel, political refugees and others--anonymous networking provides an invaluable tool, and many good reasons that anonymity can serve a very important purpose. Anonymous use of the Internet is made difficult by the many websites that know everything about us, by the cookies and ad networks, IP-logging ISPs, even nosy officials may get involved. It is no longer possible to turn off browser cookies to be left alone in your online life. Practical Anonymity: Hiding in Plain Sight Online shows you how to use the most effective and widely-used anonymity tools--the ones that protect diplomats, military and other government agencies to become invisible online. This practical guide skips the theoretical and technical details and focuses on getting from zero to anonymous as fast as possible. For many, using any of the open-source, peer-reviewed tools for connecting to the Internet via an anonymous network may be (or seem to be) too difficult because most of the information about these tools is burdened with discussions of how they work and how to maximize security. Even tech-savvy users may find the burden too great--but actually using the tools can be pretty simple. The primary market for this book consists of IT professionals who need/want tools for anonymity to test/work around corporate firewalls and router filtering as well as provide anonymity tools to their customers. Simple, step-by-step instructions for configuring and using anonymous networking software Simple, step-by-step instructions for configuring and using anonymous networking software Use of open source, time-proven and peer-reviewed tools for anonymity Plain-language discussion of actual threats and concrete suggestions for appropriate responses Easy-to-follow tips for safer computing Simple, step-by-step instructions for configuring and using anonymous networking software Use of open source, time-proven and peer-reviewed tools for anonymity Plain-language discussion of actual threats, and concrete suggestions for appropriate responses Easy to follow tips for safer computing

Segmentation and Lifetime Value Models Using SAS

2013-07-18 · O'Reilly Data Science Books O'Reilly Amazon

book

by Edward C. Malthouse

Analytics CRM Marketing SAS SQL analytics-platforms data-science

Help your organization determine the value of its customer relationships with Segmentation and Lifetime Value Models Using SAS. This book contains a wealth of information that will help you perform analyses to identify your customers and make informed marketing investments. It answers core questions on customer relationship management (CRM), provides an overall framework for thinking about CRM, and offers real-world examples across a variety of industries.

Edward C. Malthouse introduces you to a number of useful models, ranging from simple to more complicated examples, and discusses their applications. You'll learn about segmentation models for identifying groups of customers and about lifetime value models for estimating the future value of the segments. You'll learn how to prepare data and estimate models using Base SAS, SAS/STAT, SAS/IML, and SQL.

Marketing analysts, CRM analysts, database managers, and anyone looking to address the challenges of allocating marketing resources to different customer groups will benefit from the concepts and exercises in this book. Analysts will learn how to approach unique business problems. Managers will gain a sense of what's possible and what to ask of their analytics departments.

This book is part of the SAS Press program.

Data Model Patterns

2013-07-17 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by David C. Hay , Richard Barker

Data Modelling data-engineering data-models

This is the digital version of the printed book (Copyright © 1996). Learning the basics of a modeling technique is not the same as learning how to use and apply it. To develop a data model of an organization is to gain insights into its nature that do not come easily. Indeed, analysts are often expected to understand subtleties of an organization's structure that may have evaded people who have worked there for years. Here's help for those analysts who have learned the basics of data modeling (or "entity/relationship modeling") but who need to obtain the insights required to prepare a good model of a real business. Structures common to many types of business are analyzed in areas such as accounting, material requirements planning, process manufacturing, contracts, laboratories, and documents. In each chapter, high-level data models are drawn from the following business areas: The Enterprise and Its World The Things of the Enterprise Procedures and Activities Contracts Accounting The Laboratory Material Requirements Planning Process Manufacturing Documents Lower-Level Conventions

Pro Oracle Database 12c Administration, Second Edition

2013-07-17 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Darl Kuhn

Oracle Cyber Security data-engineering oracle-database-solutions

Pro Oracle Database 12c Administration is a book focused on results. Author Darl Kuhn draws from a well of experience over a decade deep to lay out real-world techniques that lead to success as an Oracle Database administrator. He gives clear explanations on how to perform critical tasks. He weaves in theory where necessary without bogging you down in unneeded detail. He is not afraid to take a stand on how things should be done. He won't leave you adrift in a sea of choices, showing you three ways to do something and then walking away. Database administration isn't about passing a certified exam, or about pointing-and-clicking your way through a crisis. Database administration is about applying the right solution at the right time, about avoiding risk, about making robust choices that get you home each night in time for dinner with your family. If you have "buck stops here" responsibility for an Oracle database, then Pro Oracle Database 12c Administration is the book you need to help elevate yourself to the level of Professional Oracle Database Administrator. Covers multi-tenant container and pluggable database implementation and management Condenses and organizes the core job of a database administrator into one volume. Takes a results-oriented approach to getting things done. Lays a foundation upon which to build a senior level of expertise What you'll learn Create a stable environment consistent across all databases that you manage Manage pluggable and multi-tenant databases Take care of job #1: backing up, and then recovering when needed Manage users and objects, and the security between them Do battle with "large"—large databases and large objects Move and distribute data using Data Pump, materialized views, external tables Automate critical jobs and tackle database troubleshooting problems Who this book is for Pro Oracle Database 12c Administration is aimed at new database administrators who aspire to senior positions in which employers and customers trust you to work independently, and with a "buck stops here" attitude.

RMAN Recipes for Oracle Database 12c: A Problem-Solution Approach, Second Edition

2013-07-17 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Arup Nanda , Darl Kuhn , Sam Alapati

Oracle data-engineering oracle-database-solutions

RMAN Recipes for Oracle Database 12c is an example-driven approach to the Oracle database administrator's #1 job responsibility: Be able to recover the database. Of all the things you are responsible for as database administrator, nothing is more important than the data itself. Like it or not, the fearsome responsibility of protecting your organization's most critical data falls squarely upon your shoulders: Lose that data and your company could fail. Lose that data and you could be out of a job. Oracle's flagship database product fortunately implements a wide-ranging feature set to aid you in the all-important task of safeguarding against data loss. Recovery Manager, or RMAN, is at the heart of that feature set, and is the tool most-often used to initiate database backup and recovery operations. In this book, well-known authors and database experts Darl Kuhn, Sam Alapati, and Arup Nanda have created a set of examples encompassing the gamut of backup and recovery tasks that you might need to perform. Sometimes, especially when the heat is on, a good example is what you need to get started towards a solution. RMAN Recipes for Oracle Database 12c delivers. It'll be the book you reach for when that dreaded call comes in at 3:00am some dreary morning. It'll be the book that lets you sleep at night knowing that no matter what transpires, that you've done your job well and can recover from any outage. RMAN Recipes for Oracle Database 12c gets right to the point with quick and easy-to-read, step-by-step solutions that can help you backup and recover your data with confidence. What you'll learn Reliably back up and recover your database using Oracle's Recovery Manager Let Oracle Database manage your backup files via the Fast Recovery Area Automate backup and recovery tasks by writing scripts Troubleshoot RMAN problems and optimize RMAN performance Recover from the loss of a control file, loss of an online redo log, and from other unusual situations Who this book is for RMAN Recipes for Oracle Database 12c is aimed squarely at Oracle database administrators responsible for database backup and recovery operations.

Apache Flume: Distributed Log Collection for Hadoop

2013-07-16 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Steven Hoffman

Hadoop apache-flume data-engineering log-data

Apache Flume: Distributed Log Collection for Hadoop is a focused guide for users looking to efficiently collect and transport log data into systems like Hadoop using Apache Flume. Its step-by-step approach covers the installation, configuration, and customization of Flume to optimize your data ingestion workflows. What this Book will help me do Effectively install and set up Apache Flume for your data ingestion processes. Understand Flume's architecture and capabilities, including sources, channels, and sinks. Learn to configure reliable data flow paths using failover and load-balancing techniques. Implement data routing and transformations during data flow using Flume. Optimize and monitor your Flume operations to enhance reliability and performance. Author(s) The authors of this book are experienced software engineers and data administrators with deep knowledge and practical expertise in implementing distributed log collection systems. Their teaching approach combines clear explanation with actionable examples to give you a hands-on learning experience. Who is it for? This book is ideal for software engineers, data engineers, and system administrators involved in handling and transporting datasets, especially those with a focus on Hadoop. If you are seeking to understand or optimize Apache Flume for your data processing pipeline, this book will guide you from beginner-friendly setup to advanced customization, helping to enhance your workflows.

Microsoft Access 2013 Inside Out

2013-07-15 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Jeff Conrad

Microsoft data-engineering database-management-tools microsoft-access

Conquer Microsoft Access 2013—from the inside out! You’re beyond the basics, so dive right into Access 2013—and use your skills to create sophisticated database apps! This supremely organized reference packs hundreds of timesaving solutions, troubleshooting tips, and workarounds. It’s all muscle and no fluff. Discover how the experts tackle Access 2013—and challenge yourself to new levels of mastery. Build an Access Services web app with Microsoft SharePoint Server Automate your Access web app with data macros Create tables in your Access web app using built-in templates Aggregate and display your web app data using totals queries Use the Autocomplete control to quickly search for related data Create a Summary view to consolidate and group information Display related data on your views with the Related Items control Package your web app for use by others in your organization Plus—download chapters on building desktop databases For Intermediate and Advanced Users and Database Designers

IBM Flex System and PureFlex System Network Implementation with Juniper Networks

2013-07-12 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Jon Tate , Tiago Nunes dos Santos , Gaston Sancassano Rodriguez , William King , Jure Arzensek , David Cain

Cloud Computing IBM Fabric data-engineering

To meet today's complex and ever-changing business demands, you need a solid foundation of server, storage, networking and software resources that is simple to deploy and can quickly and automatically adapt to changing conditions. You also need access to, and the ability to take advantage of, broad expertise and proven best practices in systems management, applications, hardware maintenance and more. IBM® PureFlex™ System, which is a part of the IBM PureSystems™ family of expert integrated systems, combines advanced IBM hardware and software along with patterns of expertise and integrates them into three optimized configurations that are simple to acquire and deploy so you can achieve faster time to value. If you want a pre-configured, pre-integrated infrastructure with integrated management and cloud capabilities, factory tuned from IBM with x86 and Power hybrid solution, IBM PureFlex System is the answer. In this IBM Redbooks® publication, we use EX4500 core switches to demonstrate interoperability with the System Networking switches (RackSwitch™ G8264 top of rack switch and the Flex system fabric EN4093 10Gb scalable switch). We also describe a redundant environment using QFX3500 switches running IBM Virtual-Link Aggregation Group (MC-LAG/vLAG) and Juniper Multi- Chassis-Link Aggregation Group.

IBM XIV Storage System Gen3: Architecture, Implementation, and Usage

2013-07-12 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Hank Sautter , Guenter Rebmann , Jim Sedgwick , Bertrand Dufrasne , Christian Burns

Cloud Computing IBM data-engineering

This IBM® Redbooks® publication describes the concepts, architecture, and implementation of the IBM XIV® Storage System. The XIV Storage System is a scalable enterprise storage system that is based on a grid array of hardware components. It can attach to both Fibre Channel Protocol (FCP) and IP network Small Computer System Interface (iSCSI) capable hosts. This system is a good fit for clients who want to be able to grow capacity without managing multiple tiers of storage. The XIV Storage System is suited for mixed or random access workloads, including online transaction processing, video streamings, images, email, and emerging workload areas, such as Web 2.0 and storage cloud. The focus of this edition is on the XIV Gen3 hardware Release 3.2, running Version 11.2 of the XIV system software. With this version, XIV Storage System offers up to five times the iSCSI throughput with new 10 GbE ports, a performance boost with new CPUs, and enhanced caching with optional solid-state drives (SSDs). The IBM XIV software Version 11.2 also offers support for Windows Server 2012, including space reclamation. And, the software enables drive rebuild times as fast as 26 minutes for a fully utilized 2 TB hard disk drive under heavy load. In the first few chapters of this book, we describe many of the unique and powerful concepts that form the basis of the XIV Storage System logical and physical architecture. We explain how the system is designed to eliminate direct dependencies between the hardware elements and the software that governs the system. In subsequent chapters, we explain the planning and preparation tasks that are required to deploy the system in your environment. A step-by-step procedure is presented that describes how to configure and administer the system. Illustrations are provided about how to perform those tasks by using the intuitive, yet powerful XIV Storage Manager GUI or the XIV command-line interface (XCLI). We describe the performance characteristics of the XIV Storage System and present options that are available for alerting and monitoring, including an enhanced secure remote support capability. This book is intended for IT professionals who want an understanding of the XIV Storage System. It also targets readers who need detailed advice on how to configure and use the system.

Numbersense: How to Use Big Data to Your Advantage

2013-07-12 · O'Reilly Data Science Books O'Reilly Amazon

book

by Kaiser Fung

Analytics Big Data Marketing SAS data-science data-science-tasks stata statistics

How to make simple sense of complex statistics--from the author of Numbers Rule Your World We live in a world of Big Data--and it's getting bigger every day. Virtually every choice we make hinges on how someone generates data . . . and how someone else interprets it--whether we realize it or not. Where do you send your child for the best education? Big Data. Which airline should you choose to ensure a timely arrival? Big Data. Who will you vote for in the next election? Big Data. The problem is, the more data we have, the more difficult it is to interpret it. From world leaders to average citizens, everyone is prone to making critical decisions based on poor data interpretations. In Numbersense, expert statistician Kaiser Fung explains when you should accept the conclusions of the Big Data "experts"--and when you should say, "Wait . . . what?" He delves deeply into a wide range of topics, offering the answers to important questions, such as: How does the college ranking system really work? Can an obesity measure solve America's biggest healthcare crisis? Should you trust current unemployment data issued by the government? How do you improve your fantasy sports team? Should you worry about businesses that track your data? Don't take for granted statements made in the media, by our leaders, or even by your best friend. We're on information overload today, and there's a lot of bad information out there. Numbersense gives you the insight into how Big Data interpretation works--and how it too often doesn't work. You won't come away with the skills of a professional statistician. But you will have a keen understanding of the data traps even the best statisticians can fall into, and you'll trust the mental alarm that goes off in your head when something just doesn't seem to add up. Praise for Numbersense " Numbersense correctly puts the emphasis not on the size of big data, but on the analysis of it. Lots of fun stories, plenty of lessons learned—in short, a great way to acquire your own sense of numbers!" Thomas H. Davenport, coauthor of Competing on Analytics and President’s Distinguished Professor of IT and Management, Babson College "Kaiser’s accessible business book will blow your mind like no other. You’ll be smarter, and you won’t even realize it. Buy. It. Now." Avinash Kaushik, Digital Marketing Evangelist, Google, and author, Web Analytics 2.0 "Each story in Numbersense goes deep into what you have to think about before you trust the numbers. Kaiser Fung ably demonstrates that it takes skill and resourcefulness to make the numbers confess their meaning." John Sall, Executive Vice President, SAS Institute "Kaiser Fung breaks the bad news—a ton more data is no panacea—but then has got your back, revealing the pitfalls of analysis with stimulating stories from the front lines of business, politics, health care, government, and education. The remedy isn’t an advanced degree, nor is it common sense. You need Numbersense." Eric Siegel, founder, Predictive Analytics World, and author, Predictive Analytics "I laughed my way through this superb-useful-fun book and learned and relearned a lot. Highly recommended!" Tom Peters, author of In Search of Excellence

Surviving the Top Ten Challenges of Software Testing: A People-Oriented Approach

2013-07-12 · O'Reilly Data Science Books O'Reilly Amazon

book

by William E. Perry , Randall W. Rice

a-b-testing a/b testing data-science data-science-tasks

This is the digital version of hte printed book (Copyright © 1997). Software testers require technical and political skills to survive what can often be a lose-lose relationship with developers and managers. Whether testing is your specialty or your stepping stone to a career as a developer, there's no better way to survive the pressures put on testers than to meet the ten challenges described in this practical handbook. This book goes beyond the technical skills required for effective testing to address the political realities that can't be solved by technical knowledge alone. Communication and negotiation skills must be in every tester's tool kit. Authors Perry and Rice compile a "top ten" list of the challenges faced by testers and offer tactics for success. They combine their years of experience in developing testing processes, writing books and newsletters on testing, and teaching seminars on how to test. The challenges are addressed in light of the way testing fits into the context of software development and how testers can maximize their relationships with managers, developers, and customers. In fact, anyone who works with software testers should read this book for insight into the unique pressures put on this part of the software development process. "Somewhere between the agony of rushed deadlines and the luxury of all the time in the world has got to be a reasonable approach to testing."—from Chapter 8 The Top Ten People Challenges Facing Testers Challenge #10: Getting Trained in Testing Challenge #9: Building Relationships with Developers Challenge #8: Testing Without Tools Challenge #7: Explaining Testing to Managers Challenge #6: Communicating with Customers—And Users Challenge #5: Making Time for Testing Challenge #4: Testing What's Thrown Over the Wall Challenge #3: Hitting a Moving Target Challenge #2: Fighting a Lose-Lose Situation Challenge #1: Having to Say No

Enterprise Data Workflows with Cascading

2013-07-11 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Paco Nathan

Big Data Hadoop Java data-engineering

There is an easier way to build Hadoop applications. With this hands-on book, you’ll learn how to use Cascading, the open source abstraction framework for Hadoop that lets you easily create and manage powerful enterprise-grade data processing applications—without having to learn the intricacies of MapReduce. Working with sample apps based on Java and other JVM languages, you’ll quickly learn Cascading’s streamlined approach to data processing, data filtering, and workflow optimization. This book demonstrates how this framework can help your business extract meaningful information from large amounts of distributed data. Start working on Cascading example projects right away Model and analyze unstructured data in any format, from any source Build and test applications with familiar constructs and reusable components Work with the Scalding and Cascalog Domain-Specific Languages Easily deploy applications to Hadoop, regardless of cluster location or data size Build workflows that integrate several big data frameworks and processes Explore common use cases for Cascading, including features and tools that support them Examine a case study that uses a dataset from the Open Data Initiative

Software Development on the SAP HANA Platform

2013-07-11 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Mark Walker (Nue)

Data Management SAP data-engineering relational-databases

Software Development on the SAP HANA Platform equips you with all the knowledge you need to master developing on this high-performance in-memory technology. From setup and installation to deploying fully functional HANA applications, this book guides you step by step. With hands-on chapters, you'll gain the analytical tools and data management proficiency needed to excel. What this Book will help me do Set up a SAP HANA development environment from scratch. Successfully execute your first development project on SAP HANA. Utilize each type of view in SAP HANA effectively for data manipulation. Create users with appropriate authorizations for reporting purposes. Deploy reporting applications to end-user software seamlessly. Author(s) Mark Walker is a seasoned expert in SAP HANA, with years of professional experience in enterprise software development and training. He brings a passion for teaching complex technologies in an approachable and practical way. Mark's hands-on approach ensures that readers not only learn but can confidently apply their new skills. Who is it for? This book is designed for software developers and data professionals looking to expand their expertise with SAP HANA. It is ideal for those new to this platform or professionals enhancing their analytical and data management skills. Whether you're starting from scratch or upgrading your capabilities, this book suits your needs. The lessons here will assist in reaching your SAP HANA proficiency goals.

Decision Trees for Analytics Using SAS Enterprise Miner

2013-07-10 · O'Reilly Data Science Books O'Reilly Amazon

book

by Padraic Neville , Barry de Ville

Analytics BI SAS analytics-platforms data-science

Decision Trees for Analytics Using SAS Enterprise Miner is the most comprehensive treatment of decision tree theory, use, and applications available in one easy-to-access place. This book illustrates the application and operation of decision trees in business intelligence, data mining, business analytics, prediction, and knowledge discovery. It explains in detail the use of decision trees as a data mining technique and how this technique complements and supplements data mining approaches such as regression, as well as other business intelligence applications that incorporate tabular reports, OLAP, or multidimensional cubes.

An expanded and enhanced release of Decision Trees for Business Intelligence and Data Mining Using SAS Enterprise Miner, this book adds up-to-date treatments of boosting and high-performance forest approaches and rule induction. There is a dedicated section on the most recent findings related to bias reduction in variable selection. It provides an exhaustive treatment of the end-to-end process of decision tree construction and the respective considerations and algorithms, and it includes discussions of key issues in decision tree practice.

Analysts who have an introductory understanding of data mining and who are looking for a more advanced, in-depth look at the theory and methods of a decision tree approach to business intelligence and data mining will benefit from this book.

This book is part of the SAS Press program.

IBM Information Server: Integration and Governance for Emerging Data Warehouse Demands

2013-07-10 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Holger Kache , Manish Bhide , Bob Kitzberger , Harald C. Smith , Chuck Ballard , Yeh-Heng Sheng , Beate Porst

BI Big Data Data Governance Data Quality DWH IBM data-engineering

This IBM® Redbooks® publication is intended for business leaders and IT architects who are responsible for building and extending their data warehouse and Business Intelligence infrastructure. It provides an overview of powerful new capabilities of Information Server in the areas of big data, statistical models, data governance and data quality. The book also provides key technical details that IT professionals can use in solution planning, design, and implementation.

Step-by-Step Programming with Base SAS 9.4

2013-07-10 · O'Reilly Data Science Books O'Reilly Amazon

book

by SAS Institute SAS

SAS analytics-platforms data-science

Provides conceptual information about the SAS programming language, as well as step-by-step examples that illustrate the concepts.

Database Cloud Storage

2013-07-06 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Prasad Bagal , Nitin Vengurlekar

Cloud Computing Cloud Storage Oracle Cyber Security cloud-storage data-engineering storage-repositories

Implement a Centralized Cloud Storage Infrastructure with Oracle Automatic Storage Management Build and manage a scalable, highly available cloud storage solution. Filled with detailed examples and best practices, this Oracle Press guide explains how to set up a complete cloud-based storage system using Oracle Automatic Storage Management. Find out how to prepare hardware, build disk groups, efficiently allocate storage space, and handle security. Database Cloud Storage: The Essential Guide to Oracle Automatic Storage Management shows how to monitor your system, maximize throughput, and ensure consistency across servers and clusters. Set up and configure Oracle Automatic Storage Management Discover and manage disks and establish disk groups Create, clone, and administer Oracle databases Consolidate resources with Oracle Private Database Cloud Control access, encrypt files, and assign user privileges Integrate replication, file tagging, and automatic failover Employ pre-engineered private cloud database consolidation tools Check for data consistency and resync failed disks Code examples in the book are available for download

Learning SPARQL, 2nd Edition

2013-07-03 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Bob DuCharme

Big Data data-engineering sparql

Gain hands-on experience with SPARQL, the RDF query language that’s bringing new possibilities to semantic web, linked data, and big data projects. This updated and expanded edition shows you how to use SPARQL 1.1 with a variety of tools to retrieve, manipulate, and federate data from the public web as well as from private sources. Author Bob DuCharme has you writing simple queries right away before providing background on how SPARQL fits into RDF technologies. Using short examples that you can run yourself with open source software, you’ll learn how to update, add to, and delete data in RDF datasets. Get the big picture on RDF, linked data, and the semantic web Use SPARQL to find bad data and create new data from existing data Use datatype metadata and functions in your queries Learn techniques and tools to help your queries run more efficiently Use RDF Schemas and OWL ontologies to extend the power of your queries Discover the roles that SPARQL can play in your applications

Apache Sqoop Cookbook

2013-07-02 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Kathleen Ting , Jarek Jarcec Cecho

Big Data DWH GitHub Hadoop Apache HBase Hive MySQL Netezza Oracle RDBMS SQL Teradata +3 more

Integrating data from multiple sources is essential in the age of big data, but it can be a challenging and time-consuming task. This handy cookbook provides dozens of ready-to-use recipes for using Apache Sqoop, the command-line interface application that optimizes data transfers between relational databases and Hadoop. Sqoop is both powerful and bewildering, but with this cookbook’s problem-solution-discussion format, you’ll quickly learn how to deploy and then apply Sqoop in your environment. The authors provide MySQL, Oracle, and PostgreSQL database examples on GitHub that you can easily adapt for SQL Server, Netezza, Teradata, or other relational systems. Transfer data from a single database table into your Hadoop ecosystem Keep table data and Hadoop in sync by importing data incrementally Import data from more than one database table Customize transferred data by calling various database functions Export generated, processed, or backed-up data from Hadoop to your database Run Sqoop within Oozie, Hadoop’s specialized workflow scheduler Load data into Hadoop’s data warehouse (Hive) or database (HBase) Handle installation, connection, and syntax issues common to specific database vendors

Developing Business Intelligence Apps for SharePoint

2013-07-02 · O'Reilly Business Intelligence Books O'Reilly Amazon

book

by David Feldman (Takeda) , Jason Himmelstein

BI Data Modelling DataViz Microsoft SQL SSRS business-intelligence data-science

Create dynamic business intelligence (BI) solutions for SharePoint faster and with more capabilities than previously possible. With this book, you’ll learn the entire process—from high-level concepts to development and deployment—for building data-rich BI applications with Visual Studio LightSwitch, SQL Server 2012, and a host of related Microsoft technologies. You’ll learn practical techniques and patterns necessary to use all of these technologies together as you build an example application through the course of the book, step by step. Discover how to solve real problems, using BI solutions that will evolve to meet future needs. Learn the fundamentals of SharePoint, LightSwitch, and SQL Server 2012 Get a solid grounding in BI application basics and database design principles Use LightSwitch to build a help desk app, including data model design and SharePoint data integration Build a tabular cube with Microsoft’s Business Intelligence Semantic Model (BISM) Dive into the data visualization stack, including Excel and SQL Server Reporting Services Create reports with Excel Services, Report Builder, and PowerView Use tips and tricks for setting up your BI application development environment

talk-data.com

Activity Trend

Top Events

Top Speakers

Practical Anonymity

Segmentation and Lifetime Value Models Using SAS

Data Model Patterns

Pro Oracle Database 12c Administration, Second Edition

RMAN Recipes for Oracle Database 12c: A Problem-Solution Approach, Second Edition

Apache Flume: Distributed Log Collection for Hadoop

Microsoft Access 2013 Inside Out

IBM Flex System and PureFlex System Network Implementation with Juniper Networks

IBM XIV Storage System Gen3: Architecture, Implementation, and Usage

Numbersense: How to Use Big Data to Your Advantage

Surviving the Top Ten Challenges of Software Testing: A People-Oriented Approach

Enterprise Data Workflows with Cascading

Software Development on the SAP HANA Platform

Decision Trees for Analytics Using SAS Enterprise Miner

IBM Information Server: Integration and Governance for Emerging Data Warehouse Demands

Step-by-Step Programming with Base SAS 9.4

Database Cloud Storage

Learning SPARQL, 2nd Edition

Apache Sqoop Cookbook

Developing Business Intelligence Apps for SharePoint