When is big data too big? Making data-based models comprehensible

From Mary Ann Liebert, Inc./Genetic Engineering News:

IMAGE: Big Data, published quarterly online with open access options and in print, facilitates and supports the efforts of researchers, analysts, statisticians, business leaders, and policymakers to improve operations, profitability, and… view more

Credit: ©Mary Ann Liebert, Inc., publishers

New Rochelle, July 11, 2016–Data-driven mathematical modeling is having an enormous impact on the ability to organize and describe very large data sets, and make inferences and predictions about populations and situations based on sampling data. However, as these models become increasingly complex, the ability of users to understand and apply them represents a growing challenge. The article “A Framework for Considering Comprehensibility in Modeling”, which describes this emerging dilemma and a strategy for developing solutions, is published in Big Data, the highly innovative, peer-reviewed journal from Mary Ann Liebert, Inc., publishers. The article is available free for download on the Big Data website until August 8, 2016.

Michael Gleicher, University of Wisconsin-Madison, defines comprehensibility as “the ability of the various stakeholders to understand relevant aspects of the modeling process.” He suggests that comprehensibility should be a key goal in model development. However, as models become more sophisticated, tradeoffs may be inevitable–even between understandability and accuracy–in some cases, improving comprehensibility may help achieve other goals in modeling.

“Gleicher provides a holistic framework of comprehensibility that considers what the various stakeholders in a data science project do and don’t understand easily and their need for comprehensibility,” says Big Data Editor-in-Chief Vasant Dhar, Professor at the Stern School of Business and the Center for Data Science at New York University. “More broadly, the article highlights comprehensibility from a human-centric standpoint, identifying the role and needs of humans in complex data science projects.”

About the Journal

Big Data, published quarterly …

Continue Reading