Please note: In order to keep Hive up to date and provide users with the best features, we are no longer able to fully support Internet Explorer. The site is still available to you, however some sections of the site may appear broken. We would encourage you to move to a more modern browser like Firefox, Edge or Chrome in order to experience the site fully.

Bad Data Handbook : Cleaning Up the Data So You Can Get Back to Work, Paperback / softback Book

Bad Data Handbook : Cleaning Up the Data So You Can Get Back to Work Paperback / softback

Paperback / softback

Description

Welcome to data science's dirty secret: real-world data is messy.

Data scientists must spend a good deal of time playing software developer, writing code to clean up data before they can actually do anything constructive with it.

It's a necessary evil, but you can still make the most of it.

This practical book walks you through several real-world examples to demonstrate the theory and practice behind working with and cleaning up dirty data.

No one tool solves all of the problems well. Wise data scientists learn many tools and learn where each one shines.

To that end, this book takes a polyglot approach: most examples will involve R and Python, but expect the occasional smattering of Groovy and sed/awk fun.

Information

Other Formats

Save 19%

£31.99

£25.85

 
Free Home Delivery

on all orders

 
Pick up orders

from local bookshops

Information