{"id":3012,"date":"2013-01-24T11:23:04","date_gmt":"2013-01-24T16:23:04","guid":{"rendered":"http:\/\/blog.law.cornell.edu\/voxpop\/?p=3012"},"modified":"2025-01-31T13:58:28","modified_gmt":"2025-01-31T18:58:28","slug":"metadata-quality-in-a-linked-data-context","status":"publish","type":"post","link":"https:\/\/blog.law.cornell.edu\/voxpop\/2013\/01\/24\/metadata-quality-in-a-linked-data-context\/","title":{"rendered":"Metadata Quality in a Linked Data Context"},"content":{"rendered":"<h1 style=\"font-family: Times;\"><span style=\"font-family: arial,helvetica,sans-serif; font-size: large;\">Van Winkle wakes<\/span><\/h1>\n<p style=\"font-family: Times;\"><span style=\"font-size: 15px; font-family: Arial; vertical-align: baseline;\">In this post, we return to a topic we first visited in a book chapter in 2004. \u00a0At that time, one of us (Bruce) was an electronic publisher of Federal court cases and statutes, and the other (Hillmann, herself a former law cataloger) was working with large, aggregated repositories of scientific papers as part of the National Sciences Digital Library project. \u00a0Then, as now, we were concerned that little attention was being paid to the practical tradeoffs involved in publishing high quality metadata at low cost. \u00a0There was a tendency to design metadata schemas that said absolutely everything that <\/span><span style=\"font-size: 15px; font-family: Arial; font-style: italic; vertical-align: baseline;\">could<\/span><span style=\"font-size: 15px; font-family: Arial; vertical-align: baseline;\"> be said about an object, often at the expense of obscuring what <\/span><span style=\"font-size: 15px; font-family: Arial; font-style: italic; vertical-align: baseline;\">needed<\/span><span style=\"font-size: 15px; font-family: Arial; vertical-align: baseline;\"> to be said about it while running up unacceptable costs. \u00a0Though we did not have a name for it at the time, we were already deeply interested in least-cost, use-case-driven approaches to the design of metadata models, and that naturally led us to wonder what \u201cgood\u201d metadata might be. \u00a0The result was \u201c<a href=\"http:\/\/www.ecommons.cornell.edu\/handle\/1813\/7895\">The Continuum of Metadata Quality: Defining, Expressing, Exploiting<\/a>\u201d, published as a chapter in an ALA publication, <\/span><span style=\"font-size: 15px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Metadata in Practice<\/span><span style=\"font-size: 15px; font-family: Arial; vertical-align: baseline;\">.<\/span><\/p>\n<p><span style=\"font-size: 15px; font-family: Arial; vertical-align: baseline;\">In that chapter, we attempted to create a framework for talking about (and evaluating) metadata quality. \u00a0We were concerned primarily with metadata as we were then encountering it: in aggregations of repositories containing scientific preprints, educational resources, and in caselaw and other primary legal materials published on the Web. \u00a0\u00a0We hoped we could create something that would be both domain-independent and useful to those who manage and evaluate metadata projects. \u00a0Whether or not we succeeded is for others to judge. <\/span><\/p>\n<h1><span style=\"font-size: 15px; font-family: Arial; vertical-align: baseline;\"><span style=\"font-size: large;\">The Original Framework<\/span><br \/>\n<\/span><\/h1>\n<p style=\"font-family: Times;\"><span style=\"font-size: 15px; font-family: Arial; vertical-align: baseline;\">At that time, we identified seven major components of metadata quality. Here, we reproduce a part of a summary table that we used to characterize the seven measures. We suggested questions that might be used to draw a bead on the various measures we proposed:<\/span><\/p>\n<table style=\"border: none;\">\n<tbody>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-weight: bold; vertical-align: baseline;\">Quality Measure<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-weight: bold; vertical-align: baseline;\">Quality Criteria<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Completeness<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Does the element set completely describe the objects?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Are all relevant elements used for each object?<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Provenance<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Who is responsible for creating, extracting, or transforming the metadata?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">How was the metadata created or extracted?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">What transformations have been done on the data since its creation?<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Accuracy<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Have accepted methods been used for creation or extraction?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">What has been done to ensure valid values and structure?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Are default values appropriate, and have they been appropriately used?<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Conformance to expectations<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Does metadata describe what it claims to?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Are controlled vocabularies aligned with audience characteristics and understanding of the objects?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Are compromises documented and in line with community expectations?<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Logical consistency and coherence<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Is data in elements consistent throughout?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">How does it compare with other data within the community?<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Timeliness<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Is metadata regularly updated as the resources change?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Are controlled vocabularies updated when relevant?<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Accessibility<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Is an appropriate element set for audience and community being used?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Is it affordable to use and maintain?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Does it permit further value-adds?<\/span><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\">There are, of course, many possible elaborations of these criteria, and many other questions that help get at them. \u00a0Almost nine years later, we believe that the framework remains both relevant and highly useful, although (as we will discuss in a later section) we need to think carefully about whether and how it relates to the quality standards that the Linked Open Data (LOD) community is discovering for itself, and how it and other standards should affect library and publisher practices and policies.<\/span><\/p>\n<h1><span style=\"color: #000000; font-size: large;\"><strong><span style=\"font-family: Arial; vertical-align: baseline;\">\u2026 and the environment in which it was created<\/span><\/strong><\/span><\/h1>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\">Our work was necessarily shaped by the environment we were in. \u00a0Though we never really said so explicitly, we were looking for quality not only in the data itself, but in the methods used to organize, transform and aggregate it across federated collections. \u00a0We did not, however, anticipate the speed or scale at which standards-based methods of data organization would be applied. \u00a0Commonly-used standards like <a href=\"http:\/\/www.google.com\/url?q=http%3A%2F%2Fxmlns.com%2Ffoaf%2Fspec%2F&amp;sa=D&amp;sntz=1&amp;usg=AFQjCNHowKlBMu-6T-JqCFpsUX0XLD-Ysg\">FOAF<\/a>, models such as those contained in <a href=\"http:\/\/schema.org\/docs\/full.html\">schema.org<\/a>, and lightweight modelling apparatus like <a href=\"http:\/\/www.google.com\/url?q=http%3A%2F%2Fwww.w3.org%2F2004%2F02%2Fskos%2F&amp;sa=D&amp;sntz=1&amp;usg=AFQjCNGVVYU994zHcO3Hr83A7h52xKbxCA\">SKOS<\/a> are all things that have emerged into common use since, and of course the use of Dublin Core &#8212; our main focus eight years ago &#8212; has continued even as the standard itself has been refined. \u00a0These days, an expanded toolset makes it even more important that we have a way to talk about how well the tools fit the job at hand, and how well they have been applied. An expanded set of design choices accentuates the need to talk about how well choices have been made in particular cases.<\/span><\/p>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\">Although our work took its inspiration from <a href=\"http:\/\/www.google.com\/url?q=http%3A%2F%2Fwww.statcan.ca%2Fenglish%2Ffreepub%2F12-586-XIE%2F12-586-XIE02001.pdf&amp;sa=D&amp;sntz=1&amp;usg=AFQjCNHza6FjiES0tmMbLAuQxb6aZoHEog\">quality standards developed by a government statistical service<\/a>, we had not really thought through the sheer multiplicity of information services that were available even then. \u00a0We were concerned primarily with work that had been done with descriptive metadata in digital libraries, but of course there were, and are, many more people publishing and consuming data in both the governmental and private sectors (to name just two). \u00a0Indeed, there was already a substantial literature on data quality that arose from within the management information systems (MIS) community, driven by concerns about the reliability and quality of \u00a0mission-critical data used and traded by businesses. \u00a0In today\u2019s wider world, where work with library metadata will be strongly informed by the Linked Open Data techniques developed for a diverse array of data publishers, we need to take a broader view. \u00a0<\/span><\/p>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\">Finally, we were driven then, as we are now, by managerial and operational concerns. As practitioners, we were well aware that metadata carries costs, and that human judgment is expensive. \u00a0We were looking for a set of indicators that would spark and sustain discussion about costs and tradeoffs. \u00a0At that time, we were mostly worried that libraries were not giving costs enough attention, and were designing metadata projects that were unrealistic given the level of detail or human intervention they required. \u00a0That is still true. \u00a0The world of Linked Data requires well-understood metadata policies and operational practices simply so publishers can know what is expected of them and consumers can know what they are getting. Those policies and practices in turn rely on quality measures that producers and consumers of metadata can understand and agree on. \u00a0In today\u2019s world &#8212; one in which institutional resources are shrinking rather than expanding &#8212; \u00a0human intervention in the metadata quality assessment process at any level more granular than that of the entire data collection being offered will become the exception rather than the rule. \u00a0\u00a0<\/span><\/p>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\">While the methods we suggested at the time were self-consciously domain-independent, they did rest on background assumptions about the nature of the services involved and the means by which they were delivered. Our experience had been with data aggregated by communities where the data producers and consumers were to some extent known to one another, using a <a href=\"http:\/\/www.google.com\/url?q=http%3A%2F%2Fwww.openarchives.org%2Fpmh%2F&amp;sa=D&amp;sntz=1&amp;usg=AFQjCNG3OJOyL8cGBK9QfeSOQ4sH8JUofA\">fairly simple technology<\/a> that was easy to run and maintain. \u00a0In 2013, that is not the case; producers and consumers are increasingly remote from each other, and the technologies used are both more complex and less mature, though that is changing rapidly.<\/span><\/p>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\">The remainder of this blog post is an attempt to reconsider our framework in that context.<\/span><\/p>\n<h1><span style=\"font-size: medium;\"><strong><span style=\"font-family: Arial; vertical-align: baseline;\">The New World<\/span><\/strong><\/span><\/h1>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft\" style=\"margin: 5px;\" src=\"http:\/\/blog.law.cornell.edu\/voxpop\/files\/2013\/01\/lod-cloud_2010.png\" alt=\"\" width=\"150\" height=\"98\" \/>The Linked Open Data (LOD) community has begun to consider quality issues; there are some <a href=\"http:\/\/lists.w3.org\/Archives\/Public\/public-lod\/2011Apr\/0140.html\">noteworthy online discussions<\/a>, as well as <a href=\"http:\/\/pamir.dia.uniroma3.it:8080\/LWDM2011\/Home.html\">workshops<\/a> resulting in a number of <a href=\"http:\/\/dl.acm.org\/citation.cfm?id=1966901\">published papers and online resources<\/a>. \u00a0It is interesting to see where the work that has come from within the LOD community contrasts with the thinking of the library community on such matters, and where it does not. \u00a0<\/span><\/p>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\">In general, the material we have seen leans toward the traditional data-quality concerns of the MIS community. \u00a0LOD practitioners seem to have started out by putting far more emphasis than we might on criteria that are essentially audience-dependent, and on operational concerns having to do with the reliability of publishing and consumption apparatus. \u00a0\u00a0As it has evolved, the discussion features an intellectual move away from those audience-dependent criteria, which are usually expressed as \u201cfitness for use\u201d, \u201crelevance\u201d, or something of the sort (we ourselves used the phrase \u201ccommunity expectations\u201d). Instead, most realize that both audience and usage \u00a0are likely to be (at best) partially unknown to the publisher, at least at system design time. \u00a0In other words, the larger community has begun to grapple with something librarians have known for a while: future uses and the extent of dissemination are impossible to predict. \u00a0There is a creative tension here that is not likely to go away. \u00a0On the one hand, data developed for a particular community is likely to be much more useful to that community; thus our initial recognition of the role of \u201ccommunity expectations\u201d. \u00a0On the other, dissemination of the data may reach far past the boundaries of the community that develops and publishes it. \u00a0The hope is that this tension can be resolved by integrating large data pools from diverse sources, or by taking other approaches that result in data models sufficiently large and diverse that \u201ccommunity expectations\u201d can be implemented, essentially, by filtering.<\/span><\/p>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\">For the LOD community, the path that began with \u00a0\u201cfitness-for-use\u201d criteria led quickly to the idea of maintaining a \u201cneutral perspective\u201d. <a href=\"http:\/\/www.slideshare.net\/cfuerber\/towards-a-vocabulary-for-data-quality-management-in-semantic-web-architectures\">Christian F\u00fcrber describes that perspective<\/a> as the idea that \u201cData quality is the degree to which data meets quality requirements no matter who is making the requirements\u201d. \u00a0To librarians, who have long since given up on the idea of cataloger objectivity, a phrase like \u201cneutral perspective\u201d may seem naive. \u00a0But it is a step forward in dealing with data whose dissemination and user community is unknown. And it is important to remember that the larger LOD community is concerned with quality in data publishing in general, and not solely with descriptive metadata, for which objectivity may no longer be of much value. \u00a0For that reason, it would be natural to expect the larger community to place greater weight on objectivity in their quality criteria than the library community feels that it can, with a strong preference for quantitative assessment wherever possible. \u00a0Librarians and others concerned with data that involves human judgment are theoretically more likely to be concerned with issues of provenance, particularly as they concern who has created and handled the data. \u00a0And indeed that is the case. <\/span><\/p>\n<h1><span style=\"font-family: Arial; font-size: large; vertical-align: baseline;\">The new quality criteria, and how they stack up<\/span><\/h1>\n<p style=\"font-family: Times;\"><span style=\"font-size: 15px; font-family: Arial; vertical-align: baseline;\">Here is a simplified comparison of our 2004 criteria with three views taken from the LOD community.<\/span><\/p>\n<table style=\"border: none; width: 624px;\">\n<tbody>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\"><a href=\"http:\/\/www.ecommons.cornell.edu\/handle\/1813\/7895\">Bruce &amp; Hillmann<\/a><\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\"><a href=\"http:\/\/answers.semanticweb.com\/questions\/1072\/quality-indicators-for-linked-data-datasets\">Dodds<\/a><\/span><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">, <\/span><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\"><a href=\"http:\/\/lists.w3.org\/Archives\/Public\/public-lod\/2011Apr\/0140.html\">McDonald<\/a><\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\"><a href=\"http:\/\/sourceforge.net\/apps\/mediawiki\/trdf\/index.php?title=Quality_Criteria_for_Linked_Data_sources&amp;action=history\">Flemming<\/a><\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Completeness<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Completeness<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Boundedness<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Typing<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Amount of data<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Provenance<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">History<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Attribution<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Authoritative<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Verifiability<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Accuracy<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Accuracy<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Typing<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Validity of documents<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Conformance to expectations<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Modeling correctness<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Modeling granularity<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Isomorphism<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Uniformity<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Logical consistency and coherence<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Directionality<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Modeling correctness<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Internal consistency<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Referential correspondence<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Connectedness<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Consistency<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Timeliness<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Currency<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Timeliness<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Accessibility <\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Intelligibility<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Licensing<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Sustainable<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Comprehensibility<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Versatility<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Licensing<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Accessibility (technical)<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Performance (technical)<\/span><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\">Placing the \u201cnew\u201d criteria into our framework was no great challenge; it appears that we were, and are, talking about many of the same things. A few explanatory remarks:<\/span><\/p>\n<ul>\n<li><strong><strong><span style=\"font-style: italic; vertical-align: baseline;\">Boundedness<\/span><span style=\"vertical-align: baseline;\"> has roughly the same relationship to <\/span><span style=\"font-style: italic; vertical-align: baseline;\">completeness <\/span><span style=\"vertical-align: baseline;\">that precision does to recall in information-retrieval metrics. The data is complete when we have everything we want; its boundedness shows high quality when we have <\/span><span style=\"font-style: italic; vertical-align: baseline;\">only<\/span><span style=\"vertical-align: baseline;\"> what we want.<\/span><\/strong><\/strong><\/li>\n<li><strong><strong><span style=\"vertical-align: baseline;\">Flemming\u2019s <\/span><span style=\"font-style: italic; vertical-align: baseline;\">amount of data<\/span><span style=\"vertical-align: baseline;\"> criterion talks about numbers of triples and links, and about the interconnectedness and granularity of the data. \u00a0These seem to us to be largely completeness criteria, though things to do with linkage would more likely fall under \u201cLogical coherence\u201d in our world. Note, again, a certain preoccupation with things that are easy to count. \u00a0In this case it is somewhat unsatisfying; it\u2019s not clear what the number of triples in a triplestore says about quality, or how it might be related to completeness if indeed that is what is intended.<\/span><\/strong><\/strong><\/li>\n<li><strong><strong><span style=\"vertical-align: baseline;\">Everyone lists criteria that fit well with our notions about <\/span><span style=\"font-style: italic; vertical-align: baseline;\">provenance.<\/span><span style=\"vertical-align: baseline;\"> In that connection, the most significant development has been <a href=\"http:\/\/www.w3.org\/2005\/Incubator\/prov\/wiki\/Provenance_Vocabulary_Mappings\">a great deal of work on formalizing the ways in which provenance is expressed<\/a>. \u00a0This is still an active level of research, with a lot to be decided. \u00a0In particular, attempts at true domain independence are not fully successful, and will probably never be so. \u00a0It appears to us that those <a href=\"http:\/\/wiki.bib.uni-mannheim.de\/dc-provenance\/doku.php\">working on the problem at DCMI<\/a> are monitoring the other efforts and incorporating the most worthwhile features.<\/span><\/strong><\/strong><\/li>\n<li><strong><strong><span style=\"vertical-align: baseline;\">Dodds\u2019 <\/span><span style=\"font-style: italic; vertical-align: baseline;\">typing<\/span><span style=\"vertical-align: baseline;\"> criterion &#8212; which basically says that dereferenceable URIs should be preferred to string literals \u00a0&#8212; participates equally in completeness and accuracy categories. \u00a0While we prefer URIs in our models, we are a little uneasy with the idea that the presence of string literals is always a sign of low quality. \u00a0Under some circumstances, for example, they might simply indicate an early stage of vocabulary evolution.<\/span><\/strong><\/strong><\/li>\n<li><strong><strong><span style=\"vertical-align: baseline;\">Flemming\u2019s <\/span><span style=\"font-style: italic; vertical-align: baseline;\">verifiability <\/span><span style=\"vertical-align: baseline;\">and <\/span><span style=\"font-style: italic; vertical-align: baseline;\">validity<\/span><span style=\"vertical-align: baseline;\"> criteria need a little explanation, because the terms used are easily confused with formal usages and so are a little misleading. \u00a0Verifiability bundles a set of concerns we think of as provenance. \u00a0Validity of documents is about accuracy as it is found in things like class and property usage. \u00a0Curiously, none of Flemming\u2019s criteria have anything to do with whether the information being expressed by the data is correct in what it says about the real world; they are all designed to convey technical criteria. \u00a0The concern is not with what the data says, but with how it says it.<\/span><\/strong><\/strong><\/li>\n<li><strong><strong><span style=\"vertical-align: baseline;\">Dodds\u2019 <\/span><span style=\"font-style: italic; vertical-align: baseline;\">modeling correctness<\/span><span style=\"vertical-align: baseline;\"> criterion seems to be about two things: whether or not the model is correctly constructed in formal terms, and whether or not it covers the subject domain in an expected way. \u00a0Thus, we assign it to both \u201cCommunity expectations\u201d and \u201cLogical coherence\u201d categories.<\/span><\/strong><\/strong><\/li>\n<li><strong><strong><span style=\"font-style: italic; vertical-align: baseline;\">Isomorphism<\/span><span style=\"vertical-align: baseline;\"> has to do with the ability to join datasets together, when they describe the same things. \u00a0In effect, it is a more formal statement of the idea that a given community will expect different models to treat similar things similarly. But there are also some very tricky (and often abused) concepts of equivalence involved; these are just beginning to receive <a href=\"http:\/\/www.w3.org\/2009\/12\/rdf-ws\/papers\/ws21\">some attention from Semantic Web researchers<\/a>.<\/span><\/strong><\/strong><\/li>\n<li><strong><strong><span style=\"font-style: italic; vertical-align: baseline;\">Licensing<\/span><span style=\"vertical-align: baseline;\"> has become more important to everyone. That is in part because Linked Data as published in the private sector may exhibit some of the proprietary characteristics we saw as access barriers in 2004, and also because even public-sector data publishers are worried about cost recovery and appropriate-use issues. \u00a0We say more about this in a later section.<\/span><\/strong><\/strong><\/li>\n<li><strong><strong><span style=\"vertical-align: baseline;\">A number of criteria listed under <\/span><span style=\"font-style: italic; vertical-align: baseline;\">Accessibility<\/span><span style=\"vertical-align: baseline;\"> have to do with the reliability of data publishing and consumption apparatus as used in production. \u00a0Linked Data consumers want to know that the endpoints and triple stores they rely on for data are going to be up and running when they are needed. \u00a0That brings a whole set of accessibility and technical performance issues into play. \u00a0At least <a href=\"http:\/\/labs.mondeca.com\/sparqlEndpointsStatus\/\">one website exists for the sole purpose of monitoring endpoint reliability<\/a>, an obvious concern of those who build services that rely on Linked Data sources. Recently, the LII made a decision to run its own mirror of the DrugBank triplestore to eliminate problems with uptime and to guarantee low latency; performance and accessibility had become major concerns. For consumers, due diligence is important.<\/span><\/strong><\/strong><\/li>\n<\/ul>\n<p style=\"font-family: Times;\"><span style=\"font-size: 15px; font-family: Arial; vertical-align: baseline;\">For us, there is a distinctly different feel to the examples that Dodds, Flemming, and others have used to illustrate their criteria; they seem to be looking at a set of phenomena that has substantial overlap with ours, but is not quite the same. \u00a0Part of it is simply the fact, mentioned earlier, that data publishers in distinct domains have distinct biases. For example, those who can\u2019t fully believe in objectivity are forced to put greater emphasis on provenance. Others who are not publishing descriptive data that relies on human judgment feel they can rely on more \u00a0\u201cobjective\u201d assessment methods. \u00a0But the biggest difference in the \u201cnew quality\u201d is that it puts a great deal of emphasis on technical quality in the construction of the data model, and much less on how well the data that populates the model describes real things in the real world. \u00a0<\/span><\/p>\n<p><span style=\"font-size: 15px; font-family: Arial; vertical-align: baseline;\">There are three reasons for that. \u00a0The first has to do with the nature of the discussion itself. All quality discussions, simply as discussions, seem to neglect notions of factual accuracy because factual accuracy seems self-evidently a Good Thing; there\u2019s not much to talk about. \u00a0Second, the people discussing quality in the LOD world are modelers first, and so quality is seen as adhering primarily to the model itself. \u00a0Finally, the world of the Semantic Web rests on the assumption that \u201canyone can say anything about anything\u201d, For some, the egalitarian interpretation of that statement reaches the level of religion, making it very difficult to measure quality by judging whether something is factual or not; from a purist\u2019s perspective, it\u2019s opinions all the way down. \u00a0There is, then, a tendency to rely on formalisms and modeling technique to hold back the tide.<\/span><\/p>\n<p><span style=\"font-size: 15px; font-family: Arial; vertical-align: baseline;\">In 2004, we suggested a set of metadata-quality indicators suitable for managers to use in assessing projects and datasets. \u00a0An updated version of that table would look like this:<\/span><\/p>\n<p style=\"font-family: Times;\"><span style=\"font-size: 15px; font-family: Arial; vertical-align: baseline;\">\u00a0<\/span><\/p>\n<table style=\"border: none;\">\n<tbody>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-weight: bold; vertical-align: baseline;\">Quality Measure<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-weight: bold; vertical-align: baseline;\">Quality Criteria<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Completeness<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Does the element set completely describe the objects?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Are all relevant elements used for each object?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Does the data contain everything you expect?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Does the data contain <\/span><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">only<\/span><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\"> what you expect?<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Provenance<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Who is responsible for creating, extracting, or transforming the metadata?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">How was the metadata created or extracted?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">What transformations have been done on the data since its creation?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Has a dedicated provenance vocabulary been used?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Are there authenticity measures (eg. digital signatures) in place?<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Accuracy<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Have accepted methods been used for creation or extraction?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">What has been done to ensure valid values and structure?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Are default values appropriate, and have they been appropriately used?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Are all properties and values valid\/defined?<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Conformance to expectations<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Does metadata describe what it claims to?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Does the data model describe what it claims to?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Are controlled vocabularies aligned with audience characteristics and understanding of the objects?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Are compromises documented and in line with community expectations?<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Logical consistency and coherence<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Is data in elements consistent throughout?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">How does it compare with other data within the community?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Is the data model technically correct and well structured?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Is the data model aligned with other models in the same domain?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Is the model consistent in the direction of relations?<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Timeliness<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Is metadata regularly updated as the resources change?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Are controlled vocabularies updated when relevant?<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Accessibility<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Is an appropriate element set for audience and community being used?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Is the data and its access methods well-documented, with exemplary queries and URIs?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Do things have human-readable labels?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Is it affordable to use and maintain?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Does it permit further value-adds?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Does it permit republication?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Is attribution required if the data is redistributed?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Are human- and machine-readable licenses available?<\/span><\/td>\n<\/tr>\n<tr style=\"height: 0px;\">\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; font-style: italic; vertical-align: baseline;\">Accessibility &#8212; technical<\/span><\/td>\n<td style=\"border: 1px solid #000000; vertical-align: top; padding: 7px;\"><span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Are reliable, performant endpoints available?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Will the provider guarantee service (eg. via a service level agreement)?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Is the data available in bulk?<\/span><br \/>\n<span style=\"font-size: 13px; font-family: Arial; vertical-align: baseline;\">Are URIs stable?<\/span><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\">The differences in the example questions reflect the differences of approach that we discussed earlier. Also, the new approach separates criteria related to <\/span><span style=\"font-family: Arial; font-size: 15px; font-style: italic; vertical-align: baseline;\">technical<\/span><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\"> accessibility from questions that relate to <\/span><span style=\"font-family: Arial; font-size: 15px; font-style: italic; vertical-align: baseline;\">intellectual<\/span><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\"> accessibility. Indeed, we suspect that \u201caccessibility\u201d may have been too broad a notion in the first place. Wider deployment of metadata systems and a much greater, still-evolving variety of producer-consumer scenarios and relationships have created a need to break it down further. \u00a0There are as many aspects to accessibility as there are types of barriers &#8212; economic, technical, and so on.<\/span><\/p>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\">As before, our list is not a checklist or a set of must-haves, nor does it contain all the questions that might be asked. \u00a0Rather, we intend it as a list of representative questions that might be asked when a new Linked Data source is under consideration. \u00a0They are also questions that should inform policy discussion around the uses of Linked Data by consuming libraries and publishers. \u00a0<\/span><\/p>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\">That is work that can be formalized and taken further. One intriguing recent development is work toward a <a href=\"http:\/\/www.slideshare.net\/cfuerber\/towards-a-vocabulary-for-data-quality-management-in-semantic-web-architectures\">Data Quality Management Vocabulary<\/a>. \u00a0\u00a0Its stated aims are to <\/span><\/p>\n<ul>\n<li><strong><strong><span style=\"vertical-align: baseline;\">support the expression of quality requirements in the same language, at web scale;<\/span><\/strong><\/strong><\/li>\n<li><strong><strong><span style=\"vertical-align: baseline;\">support the creation of consensual agreements about quality requirements<\/span><\/strong><\/strong><\/li>\n<li><strong><strong><span style=\"vertical-align: baseline;\">increase transparency around quality requirements and measures<\/span><\/strong><\/strong><\/li>\n<li><strong><strong><span style=\"vertical-align: baseline;\">enable checking for consistency <\/span><span style=\"font-style: italic; vertical-align: baseline;\">among<\/span><span style=\"vertical-align: baseline;\"> quality requirements, and<\/span><\/strong><\/strong><\/li>\n<li><strong><strong><span style=\"vertical-align: baseline;\">generally reduce the effort needed for data quality management activities<\/span><\/strong><\/strong><\/li>\n<\/ul>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\">The apparatus to be used is a formal representation of \u201cquality-relevant\u201d information. \u00a0\u00a0We imagine that the researchers in this area are looking forward to something like automated <a href=\"http:\/\/www.slideshare.net\/marin_dimitrov\/linked-data-marketplaces\">e-commerce in Linked Data<\/a>, or at least a greater ability to do corpus-level quality assessment at a distance. \u00a0Of course, \u201cfitness-for-use\u201d and other criteria that can really only be seen from the perspective of the user will remain important, and there will be interplay between standardized quality and performance measures (on the one hand) and audience-relevant features on the other. \u00a0\u00a0One is rather reminded of the interplay of technical specifications and \u201ccurb appeal\u201d in choosing a new car. \u00a0That would be an important development in a Semantic Web industry that has not completely settled on what a car is really supposed to be, let alone how to steer or where one might want to go with it.<\/span><\/p>\n<h1><span style=\"font-family: Arial; font-size: large; vertical-align: baseline;\">Conclusion<\/span><\/h1>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\">Libraries have always been concerned with quality criteria in their work as a creators of descriptive metadata. \u00a0One of our purposes here has been to show how those criteria will evolve as libraries become publishers of Linked Data, as we believe that they must. That much seems fairly straightforward, and there are many processes and methods by which quality criteria can be embedded in the process of metadata creation and management.<\/span><\/p>\n<p><span style=\"font-family: Arial; font-size: 15px; vertical-align: baseline;\">More difficult, perhaps, is deciding how these criteria can be used to construct policies for Linked Data consumption. \u00a0As we have said many times elsewhere, we believe that there are tremendous advantages and efficiencies that can be realized by linking to data and descriptions created by others, notably in connecting up information about the people and places that are mentioned in legislative information with outside information pools. \u00a0\u00a0That will require care and judgement, and quality criteria such as these will be the basis for those discussions. \u00a0Not all of these criteria have matured &#8212; or ever will mature &#8212; <a href=\"https:\/\/lirias.kuleuven.be\/bitstream\/123456789\/255807\/2\/xavuxavier-pre.pdf\">to the point where hard-and-fast metrics exist<\/a>. \u00a0We are unlikely to ever see rigid checklists or contractual clauses with bullet-pointed performance targets, at least for many of the factors we have discussed here. Some of the new accessibility criteria might be the subject of service-level agreements or other mechanisms used in electronic publishing or database-access contracts. \u00a0But the real use of these criteria is in assessments that will be made long before contracts are negotiated and signed. \u00a0In that setting, these criteria are simply the lenses that help us know quality when we see it.<\/span><\/p>\n<h2 style=\"font-family: Times;\"><span style=\"font-size: 19px; font-family: Arial; vertical-align: baseline;\">References<\/span><\/h2>\n<ul style=\"font-family: Times; margin-top: 0pt; margin-bottom: 0pt;\">\n<li><span style=\"vertical-align: baseline;\">Bruce, Thomas R., and Diane Hillmann (2004). \u201cThe Continuum of Metadata Quality: Defining, Expressing, Exploiting\u201d. In <\/span><span style=\"font-style: italic; vertical-align: baseline;\">Metadata in Practice<\/span><span style=\"vertical-align: baseline;\">, Hillmann and Westbrooks, eds. \u00a0Online at <\/span><a href=\"http:\/\/www.ecommons.cornell.edu\/handle\/1813\/7895\"><span style=\"color: #1155cc; vertical-align: baseline;\">http:\/\/www.ecommons.cornell.edu\/handle\/1813\/7895<\/span><\/a><\/li>\n<li><span style=\"vertical-align: baseline;\">DCMI Metadata Provenance Task Group, at <\/span><a href=\"http:\/\/dublincore.org\/groups\/provenance\/\"><span style=\"color: #1155cc; vertical-align: baseline;\">http:\/\/dublincore.org\/groups\/provenance\/<\/span><\/a><span style=\"vertical-align: baseline;\"> .<\/span><\/li>\n<li><span style=\"vertical-align: baseline;\">Dodds, Leigh (2010) \u201cQuality Indicators for Linked Data Datasets\u201d. Online posting at <\/span><a href=\"http:\/\/answers.semanticweb.com\/questions\/1072\/quality-indicators-for-linked-data-datasets\"><span style=\"color: #1155cc; vertical-align: baseline;\">http:\/\/answers.semanticweb.com\/questions\/1072\/quality-indicators-for-linked-data-datasets<\/span><\/a><span style=\"vertical-align: baseline;\"> .<\/span><\/li>\n<li><span style=\"vertical-align: baseline;\"> Flemming, Annika (2010) \u201cQuality Criteria for Linked Data Sources\u201d. Online at <\/span><a href=\"http:\/\/sourceforge.net\/apps\/mediawiki\/trdf\/index.php?title=Quality_Criteria_for_Linked_Data_sources&amp;action=history\"><span style=\"color: #1155cc; vertical-align: baseline;\">http:\/\/sourceforge.net\/apps\/mediawiki\/trdf\/index.php?title=Quality_Criteria_for_Linked_Data_sources&amp;action=history<\/span><\/a><\/li>\n<li><span style=\"vertical-align: baseline;\">F\u00fcrber, Christian, and Martin Hepp (2011).\u201dTowards a Vocabulary for Data Quality Management in Semantic Web Architectures\u201d. Presentation at the First International Workshop on Linked Web Data Management, Uppsala, Sweden. \u00a0Online at <\/span><a href=\"http:\/\/www.slideshare.net\/cfuerber\/towards-a-vocabulary-for-data-quality-management-in-semantic-web-architectures\"><span style=\"color: #1155cc; vertical-align: baseline;\">http:\/\/www.slideshare.net\/cfuerber\/towards-a-vocabulary-for-data-quality-management-in-semantic-web-architectures<\/span><\/a><span style=\"vertical-align: baseline;\"> .<\/span><\/li>\n<li><span style=\"vertical-align: baseline;\">W3C, \u201cProvenance Vocabulary Mappings\u201d. At <\/span><a href=\"http:\/\/www.w3.org\/2005\/Incubator\/prov\/wiki\/Provenance_Vocabulary_Mappings\"><span style=\"color: #1155cc; vertical-align: baseline;\">http:\/\/www.w3.org\/2005\/Incubator\/prov\/wiki\/Provenance_Vocabulary_Mappings<\/span><\/a><span style=\"vertical-align: baseline;\"> .<\/span><\/li>\n<\/ul>\n<p>Thomas R. Bruce is the Director of the Legal Information Institute at the Cornell Law School.<\/p>\n<p>Diane Hillmann is a principal in <a href=\"http:\/\/managemetadata.com\/\">Metadata Management Associates<\/a>, and a long-time collaborator with the Legal Information Institute.\u00a0 She is currently a member of the Advisory Board for the Dublin Core Metadata Initiative (DCMI), and was co-chair of the DCMI\/RDA Task Group.<\/p>\n<p>VoxPopuLII is edited by <a href=\"http:\/\/www.judithpratt.com\" target=\"_blank\" rel=\"noopener\">Judith Pratt<\/a>. Editors-in-Chief are <a href=\"mailto:Stephnd@illinois.edu\">Stephanie Davidson<\/a> and <a href=\"http:\/\/iinek.net\/\" target=\"_blank\" rel=\"noopener\">Christine Kirchberger<\/a>, to whom queries should be directed.<\/p>\n<!-- AddThis Advanced Settings generic via filter on the_content --><!-- AddThis Share Buttons generic via filter on the_content -->","protected":false},"excerpt":{"rendered":"<p>Van Winkle wakes In this post, we return to a topic we first visited in a book chapter in 2004. \u00a0At that time, one of us (Bruce) was an electronic publisher of Federal court cases and statutes, and the other (Hillmann, herself a former law cataloger) was working with large, aggregated repositories of scientific papers <a href='https:\/\/blog.law.cornell.edu\/voxpop\/2013\/01\/24\/metadata-quality-in-a-linked-data-context\/'>[&#8230;]<\/a><!-- AddThis Advanced Settings generic via filter on get_the_excerpt --><!-- AddThis Share Buttons generic via filter on get_the_excerpt --><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[676],"tags":[],"class_list":["post-3012","post","type-post","status-publish","format-standard","hentry","category-linked-data"],"_links":{"self":[{"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/posts\/3012","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/comments?post=3012"}],"version-history":[{"count":32,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/posts\/3012\/revisions"}],"predecessor-version":[{"id":4077,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/posts\/3012\/revisions\/4077"}],"wp:attachment":[{"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/media?parent=3012"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/categories?post=3012"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/tags?post=3012"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}