Originally posted on: http://geekswithblogs.net/TATWORTH/archive/2014/10/15/orsquoreilly-offer-on-hadoop-e-books-and-videos-until-0500-pt.aspx
To celebrate the Strata and Hadoop world, O’Reilly are offering 50% off a number of E-books and Videos at http://shop.oreilly.com/category/deals/strata-celebration.do?code=CFSTNY4
“Updated as of August 2014, this practical book will demonstrate proven methods for anonymizing health data to help your organization share meaningful datasets, without exposing patient identity. Leading experts Khaled El Emam and Luk Arbuckle walk you through a risk-based methodology, using case studies from their efforts to de-identify hundreds of datasets. Clinical data is valuable for research and other types of analytics, but making it anonymous without compromising data quality is tricky. This book demonstrates techniques for handling different data types, based on the authors’ experiences with a maternal-child registry, inpatient discharge abstracts, health insurance claims, electronic medical record databases, and the World Trade Center disaster registry, among others.”
Image may be NSFW.
Clik here to view.
“How can you get your data from frontend servers to Hadoop in near real time? With this complete reference guide, you’ll learn Flume’s rich set of features for collecting, aggregating, and writing large amounts of streaming data to the Hadoop Distributed File System (HDFS), Apache HBase, SolrCloud, Elastic Search, and other systems.”
Image may be NSFW.
Clik here to view.
Data Science at the Command Line
“This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You’ll learn how to combine small, yet powerful, command-line tools to quickly obtain, scrub, explore, and model your data. To get you started—whether you’re on Windows, OS X, or Linux—author Jeroen Janssens introduces the Data Science Toolbox, an easy-to-install virtual environment packed with over 80 command-line tools”
Image may be NSFW.
Clik here to view.
“Why a book about logs? That’s easy: the humble log is an abstraction that lies at the heart of many systems, from NoSQL databases to cryptocurrencies. Even though most engineers don’t think much about them, this short book shows you why logs are worthy of your attention.”
Image may be NSFW.
Clik here to view.
Clik here to view.
