talk-data.com talk-data.com

Topic

API

Application Programming Interface (API)

integration software_development data_exchange

52

tagged

Activity Trend

65 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Science Books ×
Web Scraping with Python

Learn web scraping and crawling techniques to access unlimited data from any web source in any format. With this practical guide, you’ll learn how to use Python scripts and web APIs to gather and process data from thousands—or even millions—of web pages at once. Ideal for programmers, security professionals, and web administrators familiar with Python, this book not only teaches basic web scraping mechanics, but also delves into more advanced topics, such as analyzing raw data or using scrapers for frontend website testing. Code samples are available to help you understand the concepts in practice.

Visio Services Quick Guide: Using Visio with Sharepoint 2013 and Office 365

In this fast-paced 100-page guide, you’ll learn to load, display and interact with dynamic, data-powered Visio diagrams in SharePoint 2013 or Office 365. Visio Services Quick Guide gives you the tools to build anything from a simple project workflow to an organizational infrastructure diagram, powered by real data from SharePoint or SQL Server. Colleagues can load your diagrams entirely in the browser, meaning that a single Visio client installation is enough to get started. Readers with JavaScript experience will also find out how to get additional control over Visio diagrams using the JavaScript mashup API, and how to build a custom data provider. The final chapter covers some useful information on administering Visio Services. Get started bringing your Visio diagrams to life with the Visio Services Quick Guide.

Data Science at the Command Line

This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You’ll learn how to combine small, yet powerful, command-line tools to quickly obtain, scrub, explore, and model your data. To get you started—whether you’re on Windows, OS X, or Linux—author Jeroen Janssens introduces the Data Science Toolbox, an easy-to-install virtual environment packed with over 80 command-line tools. Discover why the command line is an agile, scalable, and extensible technology. Even if you’re already comfortable processing data with, say, Python or R, you’ll greatly improve your data science workflow by also leveraging the power of the command line. Obtain data from websites, APIs, databases, and spreadsheets Perform scrub operations on plain text, CSV, HTML/XML, and JSON Explore data, compute descriptive statistics, and create visualizations Manage your data science workflow using Drake Create reusable tools from one-liners and existing Python or R code Parallelize and distribute data-intensive pipelines using GNU Parallel Model data with dimensionality reduction, clustering, regression, and classification algorithms

Mondrian in Action

Mondrian in Action teaches business users and developers how to use Mondrian and related tools for strategic business analysis. You'll learn how to design and populate a data warehouse and present the data via a multidimensional model. You'll follow examples showing how to create a Mondrian schema and then expand it to add basic security based on the users' roles. About the Technology Mondrian is an open source, lightning-fast data analysis engine designed to help you explore your business data and perform speed-of-thought analysis. Mondrian can be integrated into a wide variety of business analysis applications and learning it requires no specialized technical knowledge. About the Book Mondrian in Action teaches you to use Mondrian for strategic business analysis. In it, you'll learn how to organize and present data in a multidimensional manner. You'll follow apt and thoroughly explained examples showing how to create a Mondrian schema and then expand it to add basic security based on users' roles. Developers will discover how to integrate Mondrian using its olap4j Java API and web service calls via XML for Analysis. What's Inside Mondrian from the ground up -- no experience required A primer on business analytics Using Mondrian with a variety of leading applications Optimizing and restricting business data for fast, secure analysis About the Reader Written for developers building data analysis solutions. Appropriate for tech-savvy business users and DBAs needing to query and report on data. About the Authors William D. Back is an Enterprise Architect and Director of Pentaho Services. Nicholas Goodman is a Business Intelligence pro who has authored training courses on OLAP and Mondrian. Julian Hyde founded Mondrian and is the project's lead developer. Quotes A wonderful introduction to Business Intelligence and Analytics. - Lorenzo De Leon, Authentify, Inc. A great overview of the Mondrian engine that guided me through all the technical details. - Alexander Helf, veenion GmbH A significant complement to the online documentation, and an excellent introduction to how to think about designing a data warehouse. - Mark Newman, Heads Up Analytics Comprehensive ... highly recommended. - Najib Coutya, IMD Group

Integrating and Extending BIRT

The world-wide developer community has downloaded over ten million copies of BIRT (Business Intelligence and Reporting Tools). Built on the open-source Eclipse platform, BIRT is a powerful reporting system that provides an end-to-end solution from creating and deploying reports to integrating report capabilities in enterprise applications. introduces programmers to BIRT architecture and the reporting framework. BIRT technology makes it possible for programmers to build customized reports using scripting and BIRT APIs. This book also includes extensive examples of how to use the Eclipse Plug-in Development Environment to build plug-ins to extend the features of the BIRT framework. The source code for these examples is available for download at www.eclipse.org/birt. Integrating and Extending BIRT, Third Edition, Key topics covered include Installing and deploying BIRT Deploying a BIRT report to an application server Understanding BIRT architecture Scripting in a BIRT report design Integrating BIRT functionality into applications Working with the BIRT extension framework This revised and expanded third edition features the following new content Updated architectural diagrams Expanded scripting examples Debugging event handlers Developing an advanced report item with data binding Developing a data extraction extension Developing a charting extension Readers may also be interested in this book’s companion volume. BIRT: A Field Guide, Third Edition, is the authoritative guide to using BIRT Report Designer, the graphical tool that enables users of all levels to build reports, from simple to complex, without programming.

Mining the Social Web

Popular social networks such as Facebook and Twitter generate a tremendous amount of valuable data on topics and use patterns. Who's talking to whom? What are they talking about? How often are they talking? This concise and practical book shows you how to answer these questions and more by harvesting and analyzing data using social web APIs, Python, and pragmatic storage technologies such as Redis, CouchDB, and NetworkX. With Mining the Social Web, intermediate to advanced programmers will learn how to harvest and analyze social data in way that lends itself to hacking as well as more industrial-strength analysis. Algorithms are designed with robustness and efficiency in mind so that the approaches scale well on an ordinary piece of commodity hardware. The book is highly readable from cover to cover as content progressively grows in complexity, but also lends itself to being read in an ad-hoc fashion. Use easily adaptable scripts to access popular social network APIs including Twitter, OpenSocial, and Facebook Learn approaches for slicing and dicing social data that's been harvested from social web APIs as well as other common formats such as email and markup formats Harvest data from other sources such as Freebase and other sites to enrich your analytic capabilities with additional context Visualize and analyze data in interactive ways with tools built upon rich UI JavaScript toolkits Get a concise and straightforward synopsis of some practical technologies from the semantic web landscape that you can incorporate into your analysis This book is still in progress, but you can get going on this technology through our Rough Cuts edition, which lets you read the manuscript as it's being written, either online or via PDF.

21 Recipes for Mining Twitter

Millions of public Twitter streams harbor a wealth of data, and once you mine them, you can gain some valuable insights. This short and concise book offers a collection of recipes to help you extract nuggets of Twitter information using easy-to-learn Python tools. Each recipe offers a discussion of how and why the solution works, so you can quickly adapt it to fit your particular needs. The recipes include techniques to: Use OAuth to access Twitter data Create and analyze graphs of retweet relationships Use the streaming API to harvest tweets in realtime Harvest and analyze friends and followers Discover friendship cliques Summarize webpages from short URLs This book is a perfect companion to O’Reilly's Mining the Social Web.

A Developer’s Guide to Amazon SimpleDB

The Complete Guide to Building Cloud Computing Solutions with Amazon SimpleDB Using SimpleDB, any organization can leverage Amazon Web Services (AWS), Amazon’s powerful cloud-based computing platform–and dramatically reduce the cost and resources associated with application infrastructure. Now, for the first time, there’s a complete developer’s guide to building production solutions with Amazon SimpleDB. Pioneering SimpleDB developer Mocky Habeeb brings together all the hard-to-find information you need to succeed. Mocky tours the SimpleDB platform and APIs, explains their essential characteristics and tradeoffs, and helps you determine whether your applications are appropriate for SimpleDB. Next, he walks you through all aspects of writing, deploying, querying, optimizing, and securing Amazon SimpleDB applications–from the basics through advanced techniques. Throughout, Mocky draws on his unsurpassed experience supporting developers on SimpleDB’s official Web forums. He offers practical tips and answers that can’t be found anywhere else, and presents extensive working sample code–from snippets to complete applications. With A Developer’s Guide to Amazon SimpleDB you will be able to Evaluate whether a project is suited for Amazon SimpleDB Write SimpleDB applications that take full advantage of SimpleDB’s availability, scalability, and flexibility Effectively manage the entire SimpleDB application lifecycle Deploy cloud computing applications faster and more easily Work with SELECT and bulk data operations Fine tune queries to optimize performance Integrate SimpleDB security into existing organizational security plans Write and enhance runtime SimpleDB clients Build complete applications using AJAX and SimpleDB Understand low-level issues involved in writing clients and frameworks Solve common SimpleDB usage problems and avoid hidden pitfalls This book will be an indispensable resource for every IT professional evaluating or using SimpleDB to build cloud-computing applications, clients, or frameworks.

Advanced Web Metrics with Google Analytics™, Second Edition

Valuable tips and tricks for using the latest version of Google Analytics Packed with insider tips and tricks, this how-to guide is fully revised to cover the latest version of Google Analytics and shows you how to implement proven Web analytics methods and concepts. This second edition of the bestselling Advanced Web Metrics with Google Analytics is the perfect book for marketers, vendors, consultants, and Webmasters who want to learn the installation, configuration, tracking techniques, and best practices of Google Analytics. Google Analytics is a free tool that measures Web site effectiveness and helps users better understand how web site performance; this book is a detailed usage guide written by one of the software's original creators Explains what filters keep data accurate, how to measure Flash usage and tag for e-mail marketing, and what visitor segmentation provides the most useful feedback Examines principles and practices of Web analytics, then shows how to use GA's reports and how to track dynamic Web pages, banners, outgoing links, and contact forms Discusses advanced setups for configuring goals and filters, how to integrate GA with third-party systems, and how to leverage the new API Advanced Web Metrics with Google Analytics, Second Edition is valuable for both novice and experienced users of Google Analytics.

Integrating and Extending BIRT

The world-wide developer community has downloaded over three million copies of BIRT (Business Intelligence and Reporting Tools) from the Eclipse web site. Built on the open-source Eclipse platform, BIRT is a powerful reporting system that provides an end-to-end solution, from creating and deploying reports to integrating report capabilities in enterprise applications. The second of a two-book series on business intelligence and reporting technology, introduces programmers to BIRT architecture and the reporting framework. BIRT technology makes it possible for a programmer to build a customized report using scripting and BIRT APIs. A programmer can also extend the BIRT framework by creating a new plug-in using the Eclipse Plug-in Development Environment. This book provides extensive examples on how to build plug-ins to extend the features of the BIRT framework. The source code for these examples is available for download. Integrating and Extending BIRT, Second Edition The topics discussed include Installing and deploying BIRT Deploying a BIRT report to an application server Understanding BIRT architecture Scripting in a BIRT report design Integrating BIRT functionality into applications Working with the BIRT extension framework This second edition, revised and expanded, adds the following new content Updated architectural diagrams Expanded scripting examples Tag library descriptions In-depth description of BIRT Web Viewer Configuring BIRT to use a JNDI connection XML report rendering plug-in example Fragment plug-in localization example Open Data Access (ODA) plug-in example implementing the new Data Tools Platform (DTP) design and run-time wizards

Integrating and Extending BIRT

A new edition of this title is available, ISBN-10: 0321580303 ISBN-13: 9780321580306 The second of a two-book series about business intelligence and reporting technology, introduces programmers to BIRT architecture and the reporting framework. Built on the Eclipse platform, this exciting technology makes it possible for programmers to build and deploy customized reports using scripting and BIRT APIs. In addition, programmers can extend the BIRT framework by building new plug-ins using the Eclipse Plug-in Development Environment. Integrating and Extending BIRT This book describes the key components of BIRT architecture, applications, and engines, including installing, deploying, and troubleshooting the Reporting and Charting engines in an Enterprise Java application-server environment.For developers who wish to incorporate custom business logic in their reports or access data from Java objects, the book describes BIRT's extensive scripting capabilities in JavaScript and Java. For developers who want to extend the BIRT framework, the book describes the key extension points to use in creating customized report items, rendering extensions for generating output other than HTML or PDF, and Open Data Access (ODA) drivers for new data sources. The book provides extensive examples of how to build plug-ins to extend the features of the BIRT framework. Topics discussed include Installing and deploying BIRT Deploying a BIRT report to an application server Understanding BIRT architecture Scripting in a BIRT report design Integrating BIRT functionality in applications Working with the BIRT extension framework

Greasemonkey Hacks

Greasemonkey Hacks is an invaluable compendium 100 ingenious hacks for power users who want to master Greasemonkey, the hot new Firefox extension that allows you to write scripts that alter the web pages you visit. With Greasemonkey, you can create scripts that make a web site more usable, fix rendering bugs that site owners can't be bothered to fix themselves, or add items to a web site's menu bar. You can alter pages so they work better with technologies that speak a web page out loud or convert it to Braille. Greasemonkey gurus can even import, combine, and alter data from different web sites to meet their own specific needs. Greasemonkey has achieved a cult-like following in its short lifespan, but its uses are just beginning to be explored. Let's say you're shopping on an e-commerce site. You can create a script that will automatically display competitive prices for that particular product from other web sites. The possibilities are limited only by your imagination and your Greasemonkey expertise. Greasemonkey Hacks can't help you with the imagination part, but it can provide the expert hacks-complete with the sample code-you need to turn your brainstorms into reality. More than just an essential collection of made-to-order Greasemonkey solutions, Greasemonkey Hacks is crammed with sample code, a Greasemonkey API reference, and a comprehensive list of resources, to ensure that every resource you need is available between its covers. Some people are content to receive information from websites passively; some people want to control it. If you are one of the latter, Greasemonkey Hacks provides all the clever customizations and cutting-edge tips and tools you need to take command of any web page you view.