Tuesday, February 07, 2012

Follow us on ...

Bridgewater Courier News/Home News Tribune

The qualifications the central New Jersey Courier News attached to the Social Security Death Records Index not only qualified the data, it provided a whole new perspective.

http://www.mycentraljersey.com/apps/pbcs.dll/section?category=datauniverse01

image

The Salt Lake Tribune

The Salt Lake Tribune does a particularly fine job of helping readers understand quirks in the data. Each of the major database entries is preceded by an explanatory text that includes its limitations. On Childcare Citations, the site offered this explanation: “Approximately 3,700 records of citations, infractions and deficiencies for over 1,000 childcare and daycare providers licensed in Salt Lake County for approximately the last 3 years.

http://www.utahsright.com/

image

Caveats and explanations

Indianapolis database expert Mark Nichols said it best in a column of tips for his Data Central readers: “All data is dirty.” Raw data often contains errors, and it also has limitations. “The dataset may be a snapshot in time and may not reflect what’s happening right now. In other cases, it may only be a sampling of people or incidents, and may not reflect the full picture. In all cases, it’s just one source of information, not a ‘be-all-end-all’” source,” he wrote.

His advice to readers was to find all they can about a dataset so they can draw the best possible conclusions from it. Our advice to database managers is to give readers a solid briefing on the strengths and weaknesses of each database offered. Here are two we found in our survey.

Copyright 2010 by ASNE.org