ArchiveJune 20, 2016

NY Times: Majority of our time is spent on data cleaning, not data analysis

Yet far too much handcrafted work — what data scientists call “data wrangling,” “data munging” and “data janitor work” — is still required. Data scientists, according to interviews and expert estimates, spend from 50 percent to 80 percent of their time mired in this more mundane labor of collecting and preparing unruly digital data, before it can be explored for useful nuggets. “Data wrangling is...

About me

Stephen

Professor and quant guy. Libertarian turned populist Republican. Trying to learn Japanese and play Spanish Baroque music on the ukulele.

Subscribe via email

Enter your email address to subscribe to my blog and receive notifications of new posts by email.

Tags