{"id":1342,"date":"2011-10-02T09:38:12","date_gmt":"2011-10-02T14:38:12","guid":{"rendered":"http:\/\/blog.law.cornell.edu\/voxpop\/?p=1342"},"modified":"2011-10-02T09:38:12","modified_gmt":"2011-10-02T14:38:12","slug":"csl-metadata-and-legal-information-that-just-works","status":"publish","type":"post","link":"https:\/\/blog.law.cornell.edu\/voxpop\/2011\/10\/02\/csl-metadata-and-legal-information-that-just-works\/","title":{"rendered":"CSL, Metadata, and Legal Information that Just Works"},"content":{"rendered":"<p><a href=\"http:\/\/time-az.com\/images\/2009\/06\/20090608img_339663_5807068_0.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft\" style=\"margin: 4px\" src=\"http:\/\/time-az.com\/images\/2009\/06\/20090608img_339663_5807068_0.jpg\" alt=\"\" width=\"300\" height=\"225\" \/><\/a><\/p>\n<p style=\"margin-top: 0em\">In the wake of a decisive victory at the Battle of Sekigahara in 1600, Tokugawa Ieyasu treated rival Japanese warlords to a simple but effective instrument of control, pioneered in the preceding Era of the Warring States. The Daimyo, as the defeated clan heads were known, retained control of their respective domains, but were required to reside in the newly established seat of government at Edo (now Tokyo) in alternate years. They were free to return home in the off-years, but only by leaving their princesses and heirs behind in the walled gardens of the capitol, as a token of the enduring bond of friendship and mutual admiration that united the Shogun and his sometimes grudging subordinates.<\/p>\n<p>The processions of competing Daimyo moving to and from the seat of real power soon became a measure of status, and the cost of these semi-annual journeys would eventually consume fully half of each Daimyo\u2019s disposable income. This contributed greatly to the prosperity of communities stationed along the wayside, where tradesmen, innkeepers, chefs, entertainers, and the occasional thief shared in revenue extracted from the peasants in the Daimyo\u2019s fiefdom back home. A cynic might say that the practice of <em>san-kin-k\u014dtai<\/em> (\u53c2\u52e4\u4ea4\u4ee3) was little more than an elaborate system of hostage-taking, but in its way it was very good for business \u2014 at least if you did not have the misfortune to be a peasant.<\/p>\n<p><a href=\"http:\/\/www.flickr.com\/photos\/boojee\/3743753784\/\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-1508\" style=\"margin: 4px\" src=\"http:\/\/blog.law.cornell.edu\/voxpop\/files\/2011\/09\/subaru-metadata-smudged.jpg\" alt=\"Original photo by Shira Golding\" width=\"300\" height=\"225\" \/><\/a><\/p>\n<p>Japan later shed the hobbles of feudal regulation, of course, and the population are now free to move about as they please; but for Daimyo read <em>content<\/em>, and for the Daimyo\u2019s princesses and progeny read <em>metadata<\/em>, and you have a description of a familiar Internet business model. Too familiar, perhaps, as most of us now rely on content supplied through <a href=\"http:\/\/ssrn.com\/abstract=635141\">walled gardens<\/a> for much of our research work.<\/p>\n<p>Just as the freedom of individuals is improved by lifting restraints on travel, so the flow of content is more meaningful when accompanied by the descriptive metadata that is its natural companion. As observed by others in this space (most recently <a href=\"http:\/\/blog.law.cornell.edu\/voxpop\/2011\/07\/15\/tear-down-this-paywall\/\">here<\/a> and <a href=\"http:\/\/blog.law.cornell.edu\/voxpop\/2011\/09\/01\/universal-citation-for-state-codes\/\">here<\/a>), there are barriers today to the free flow of legal information. As will be outlined below, hamstrung metadata is, unfortunately, one of them. This information \u2014 mundane details like the date, court, and party names of a legal decision, and the volume, journal, page or identifier used to locate it \u2014 are curiously hard for <em>machines<\/em> to find in the pages issued by any of the leading commercial services in the 40-year-old online legal information industry.<\/p>\n<p>More than any fundamental difference in the materials themselves, captive metadata accounts for the striking gap that has emerged between the research tools available in law and in other disciplines. Driven by the needs of researchers in the sciences and the humanities, personal research platforms that thrive on metadata are now widely available: to make them servants of the law, they want only to be fed.<\/p>\n<p>One element of this alternative infrastructure that depends on rich metadata provision is the Citation Style Language (<a href=\"http:\/\/citationstyles.org\/\">CSL<\/a>), which is the proper subject of this essay. The next three sections provide a short introduction to CSL, followed by a few observations on the state of legal metadata provision on today&#8217;s legal Internet. The essay concludes with a comment on some of the lights that seem to be flickering into view at the end of this particular tunnel, and on the prospective benefits of at last bringing the law within reach of a modern research support ecosystem.<\/p>\n<h1>About CSL<\/h1>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignright\" src=\"http:\/\/citationstylist.org\/wp-content\/uploads\/csl+citeproc-js-250.png\" alt=\"\" width=\"250\" height=\"386\" \/><\/p>\n<p>The Citation Style Language is an XML vocabulary for accurately describing citation and bibliography formats. Given the <a href=\"http:\/\/web.archive.org\/web\/20100213183221\/http:\/\/community.muohio.edu\/blogs\/darcusb\/archives\/2006\/07\/29\/csl-progress\">breath of life<\/a> by the original <a href=\"http:\/\/www.zotero.org\/\">Zotero<\/a> citation formatter, CSL is now entering its eighth year of development, can boast two full production implementations, and drives citation formatting in at least six major bibliographic or text processing projects, with total user numbers in the millions.<\/p>\n<p>The illustration to the right provides a simplified view of CSL processing flow. In greater detail it works like this:<\/p>\n<ol>\n<li>A running copy of the processor is cast (&#8220;instantiated&#8221;) using the rules specified in a particular <a href=\"http:\/\/www.zotero.org\/styles\/chicago-author-date\">CSL style file<\/a>.<\/li>\n<li>The calling application sends fine-grained item metadata to the processor.<\/li>\n<li>The processor registers data it receives, for the purpose of tracking the document context of each item.<\/li>\n<li>The calling application sends a request for a citation or a bibliography listing. In the former case, the call will supply information about document state (note numbers and the like), and additional details specific to the request (such as a pinpoint page number).<\/li>\n<li>The processor analyses the request, calculates any auto-generated item variables, and applies any disambiguation rules defined in the style to assure that item references are unique.<\/li>\n<li>The processor returns the citation or bibliography listing as a serialized string in the language (such as English or French) and the markup format (such as XHTML or RTF) that it has been configured to deliver.<\/li>\n<\/ol>\n<p>The upshot of all this swirling machinery is that <em>generic metadata<\/em> can be used to generate citations in <em>arbitrary formats<\/em>. In operation, this means that an article originally written according to, say, the Oxford Standard for Citation of Legal Authorities (<a href=\"http:\/\/www.law.ox.ac.uk\/publications\/oscola.php\">OSCOLA<\/a>) can be reformatted on the fly to conform to the requirements of, say, the <a href=\"http:\/\/lawjournal.mcgill.ca\/citeguide.php\">McGill Guide<\/a>, or perhaps the <a href=\"http:\/\/mulr.law.unimelb.edu.au\/go\/AGLC3\">Australian Guide to Legal Citation<\/a> (PDF) or the <a href=\"http:\/\/www.alwd.org\/publications\/citation_manual.html\">ALWD Manual<\/a>. This functionality is used daily by researchers in most fields worldwide, and there is no reason the law should be an exception.<\/p>\n<p>The automated generation of citations is just one benefit of this processing flow; it also enables the embedding of cited metadata directly in the source document (for sharing between collaborators), and it allows links to referenced resources to be attached at the point of production (for ease of referencing after publication). Hints of resistance <a href=\"https:\/\/github.com\/citation-style-language\/styles\/issues\/53\">from some quarters<\/a> notwithstanding, such tools clearly promise to save law professors, law students, lawyers, court clerks, judges, and others who must do legal drafting a tremendous amount of time.<\/p>\n<h1>Formatting citations<\/h1>\n<p>There are a few commonly-encountered wrinkles in legal data and citation styles that CSL and the <span style=\"font-family: mono\">citeproc-js<\/span> formatter have been carefully designed to address. To give readers a glimpse of this work, a few basic elements of the language are laid out below. We&#8217;ll begin with the following sample citation in the OSCOLA style: <em>Jones &amp; others v Wright<\/em> [1991] 3 All ER 88.<\/p>\n<p>The bare case name can be produced with the following construct:<\/p>\n<pre><span style=\"background: #c9e7c4\">&lt;text variable=\"title\" font-style=\"italic\" strip-periods=\"true\"\/&gt;<\/span><\/pre>\n<p>(Note the use of <span style=\"font-family: mono\">font-style=&#8221;italic&#8221;<\/span> to render the variable content in italic type, and of the <span style=\"font-family: mono\">strip-periods=&#8221;true&#8221;<\/span> attribute, which will be discussed below.)<\/p>\n<p>The year element can be produced with the following code:<\/p>\n<pre><span style=\"background: #c9e7c4\">&lt;date variable=\"issued\" form=\"text\" date-parts=\"year\" prefix=\"[\" suffix=\"]\"\/&gt;<\/span><\/pre>\n<p>(Note the use of <span style=\"font-family: mono\">prefix<\/span> and <span style=\"font-family: mono\">suffix<\/span>.)<\/p>\n<p>To build the full cite, we join these and other elements together by wrapping them in a <span style=\"font-family: mono\">group<\/span> element and setting a single space as the delimiter. In the example below, we also define this construct as a macro, so that it can easily be reused in multiple contexts in the style:<\/p>\n<pre><span style=\"background: #c9e7c4\">&lt;macro name=\"oscola-case\"&gt;<\/span>\r\n    <span style=\"background: #c9e7c4\">&lt;group delimiter=\" \"&gt;<\/span>\r\n        &lt;text variable=\"title\" font-style=\"italic\" strip-periods=\"true\"\/&gt;\r\n        &lt;date variable=\"issued\"  form=\"text\" date-parts=\"year\"\r\n              prefix=\"[\" suffix=\"]\"\/&gt;\r\n        <span style=\"background: #c9e7c4\">&lt;number variable=\"issue\"\/&gt;<\/span>\r\n        <span style=\"background: #c9e7c4\">&lt;text variable=\"container-title\"\/&gt;<\/span>\r\n        <span style=\"background: #c9e7c4\">&lt;text variable=\"page-first\"\/&gt;<\/span>\r\n    <span style=\"background: #c9e7c4\">&lt;\/group&gt;<\/span>\r\n<span style=\"background: #c9e7c4\">&lt;\/macro&gt;<\/span><\/pre>\n<p>If we want to use this cite form for English legal cases only, we can wrap it in a condition:<\/p>\n<pre><span style=\"background: #c9e7c4\">&lt;choose&gt;<\/span>\r\n    <span style=\"background: #c9e7c4\">&lt;if type=\"legal_case\" jurisdiction=\"gb\" match=\"all\"&gt;<\/span>\r\n        &lt;text macro=\"oscola-case\"\/&gt;\r\n    <span style=\"background: #c9e7c4\">&lt;\/if&gt;<\/span>\r\n<span style=\"background: #c9e7c4\">&lt;\/choose&gt;<\/span><\/pre>\n<p>(Note the <span style=\"font-family: mono\">type<\/span>, <span style=\"font-family: mono\">jurisdiction<\/span> and <span style=\"font-family: mono\">match<\/span> attributes, and the use of a <span style=\"font-family: mono\">text<\/span> node with a <span style=\"font-family: mono\">macro<\/span> attribute to call our macro.)<\/p>\n<p>With the code above, we will obtain something close to our target cite format if we arrange for the calling application to feed the processor JSON input like the following:<\/p>\n<pre>{\r\n    \"container-title\": \"All England Law Reports\",\r\n    \"date\": {\r\n        \"date-parts\": [[\"1991\"]]\r\n    },\r\n    \"issue\": \"3\",\r\n    \"page\": \"88\",\r\n    \"title\": \"Jones &amp; others v. Wright\"\r\n}<\/pre>\n<p>Looking carefully at this input, we can see that there are some small discrepancies in the metadata:<\/p>\n<ul>\n<li>the period after the <span style=\"font-family: mono\">v<\/span>; and<\/li>\n<li>the full name of the reporter.<\/li>\n<\/ul>\n<p>These details can be handled automatically in the processor. The first issue is trivial: quashing periods is a general requirement of OSCOLA, and this one will be removed by the <span style=\"font-family: mono\">strip-periods=&#8221;true&#8221;<\/span> attribute that we set on the title element. The second issue requires a bit of further explanation.<\/p>\n<h1>Applying abbreviations<\/h1>\n<p>In our sample input, the journal name has been spelled out in full to avoid ambiguity. This is an example of best practice, although the field content does differ from our desired output of &#8220;All\u00a0ER&#8221;. The current version of Zotero provides a <span style=\"font-family: mono\">journalAbbreviation<\/span> field for each item, but this has known limitations, and is not suitable for legal writing.<\/p>\n<p>Many styles require that commonly cited journal names, at least, be abbreviated. Some styles have mandatory and idiosyncratic abbreviation requirements. As Judge Posner <a href=\"http:\/\/www.yalelawjournal.org\/images\/pdfs\/940.pdf\">commented recently<\/a> (PDF) concerning the requirements of <span style=\"font-variant: small-caps\">Bluebook: A Uniform System of Citation<\/span>: <em>It\u2019s as if there were a heavy tax  on letters, making it costly to write out Coast Guard Court of Criminal Appeals instead of abbreviating it &#8230;<\/em> There is no tax on letters, of course, but the lack of a truly uniform system of abbreviation means that such elaborate schemes impose a significant cost in their own right. In Zotero, if journal abbreviations are registered directly on individual items in the user&#8217;s personal library, they must be entered manually for each item,  both when the original item is created, and each time the user wants to generate citations in a different style. This is not acceptable: <em>metadata should be generic<\/em>.<\/p>\n<p>With a view to squaring the needs of users with those of the more demanding styles, the <span style=\"font-family: mono\">citeproc-js<\/span> processor allows arbitrary abbreviation lists to be registered and managed on a per-style basis.<\/p>\n<p>Here&#8217;s how it works. When the processor encounters a field that requests <span style=\"font-family: mono\">form=&#8221;short&#8221;<\/span>, it looks for the field content in an externally-supplied abbreviation list derived from a small (persistent) database. If there is no match, the processor opens an empty entry for the field in its (ephemeral) run-time registry. In an application that draws on this functionality, the user can visit the run-time listing at any time, and enter suitable abbreviations. These are then registered in the persistent external database, where they are remembered for future use with the current style.<\/p>\n<p>In our case, the user would enter &#8220;All\u00a0ER&#8221; as the journal abbreviation, and the application would store and deliver auxiliary input like the following:<\/p>\n<pre>{\r\n    \"container-title\": {\r\n        \"All England Law Reports\": \"All ER\"\r\n    }\r\n}<\/pre>\n<p>Abbreviations list support has not yet been implemented in mainstream projects, but I have built <a href=\"http:\/\/citationstylist.org\/tools\/?#abbreviations-gadget-entry\">a small Firefox add-on<\/a> for use with Zotero that draws upon it, and I am happy to report that it <a href=\"http:\/\/citationstylist.org\/2011\/09\/06\/abbreviations-gadget-available\/\">does work<\/a>, <a href=\"http:\/\/citationstylist.org\/2011\/09\/19\/hereinafter-support-for-note-styles\/\">as advertised<\/a>, both for journal abbreviations, and for other similar purposes (such as &#8220;hereinafter&#8221; support).<\/p>\n<p>In our CSL code, invoking the abbreviation list machinery requires only a small change to the citation macro:<\/p>\n<pre>&lt;macro name=\"oscola-case\"&gt;\r\n    &lt;group delimiter=\" \"&gt;\r\n        &lt;text variable=\"title\" font-style=\"italic\" strip-periods=\"true\"\/&gt;\r\n        &lt;date variable=\"issued\"  form=\"text\" date-parts=\"year\"\r\n              prefix=\"[\" suffix=\"]\"\/&gt;\r\n        &lt;number variable=\"issue\"\/&gt;\r\n        &lt;text variable=\"container-title\" <span style=\"background: #c9e7c4\">form=\"short\"<\/span>\/&gt;\r\n        &lt;text variable=\"page-first\"\/&gt;\r\n    &lt;\/group&gt;\r\n&lt;\/macro&gt;<\/pre>\n<p>A full style will be more elaborate, but the basic logical structures are the same, with conditional statements used to select simple nested groups of nodes that describe the output to be produced.<\/p>\n<hr \/>\n<p><a href=\"http:\/\/www.flickr.com\/photos\/andrew_bolin\/3832552009\/sizes\/z\/in\/photostream\/\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-1622\" style=\"margin: 4px\" src=\"http:\/\/blog.law.cornell.edu\/voxpop\/files\/2011\/09\/channel-change-1-150.jpg\" alt=\"\" width=\"150\" height=\"197\" \/><\/a>I&#8217;ll draw a line under the technical discussion at this point, but you get the idea.<\/p>\n<p>CSL is an elegant and expressive language that has grown under the tutelage of strict demands from academics and graduate students in many fields. The language is fully documented in the <a href=\"http:\/\/citationstyles.org\/downloads\/specification.html\">CSL Specification<\/a>. The proposed extensions for full legal support, documented in the <a href=\"http:\/\/gsl-nagoya-u.net\/http\/pub\/citeproc-js-csl.html\"><span style=\"font-family: mono\">citeproc-js<\/span> CSL Specification Supplement<\/a>, have been carefully formulated, and I am open to feedback. Style development is proceeding apace, and increments and milestones are being reported through the <a href=\"http:\/\/citationstylist.org\/\">CitationStylist.org<\/a> website, which serves as a clearinghouse for legal and multilingual style development. From experience with the first target style for full implementation (the Creative Commons licensed <a href=\"http:\/\/www.law.ox.ac.uk\/publications\/oscola.php\">OSCOLA<\/a>), the prospects for CSL style support for legal resources that &#8220;disappears&#8221;, as such tools ought to do, are very bright.<\/p>\n<h1>Input from the Web<\/h1>\n<p><a href=\"http:\/\/www.flickr.com\/photos\/mwichary\/4359237861\/\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-1504\" style=\"margin: 4px\" src=\"http:\/\/blog.law.cornell.edu\/voxpop\/files\/2011\/09\/typing-ball.jpg\" alt=\"Photo by Marcin Wichary\" width=\"300\" height=\"227\" \/><\/a>In addition to bringing us open-source community-driven citation formatting technology, Zotero offers one-click acquisition of content, to a full-featured personal electronic library on the user&#8217;s desktop. This is handy, even essential, in today&#8217;s world of overabundant information sources. It is facilitated by the fact that in most fields of study, aggregator sites have a long history of providing access to structured metadata from their pages.<\/p>\n<p>The server-side technology that enables one-click content acquisition well predates the Internet. Libraries that run their catalogs on the 1980&#8217;s <a href=\"http:\/\/www.loc.gov\/marc\/\">MARC standard<\/a> or one of its variants can and often do expose these records to the Internet. Aggregators in the sciences typically provide <a href=\"http:\/\/texlipse.sourceforge.net\/manual\/bibtex.html\">BibTeX<\/a> records, which researchers have relied upon since the original format was frozen in 1988. Booksellers and publishing consortia offer metadata keyed to <a href=\"http:\/\/www.isbn.org\/standards\/home\/about\/index.html\">ISBN numbers<\/a>, and the publishers of academic and other journals participate in <a href=\"http:\/\/ssrn.com\/abstract=1577074\">the DOI system<\/a> for assigning unique IDs keyed to canonical metadata for individual articles. The world of academic discourse is swimming in rich, life-giving metadata. Until, that is, one arrives on the salted shores of the law, where there is no water, and precious little sand.<\/p>\n<p><a href=\"http:\/\/www.flickr.com\/photos\/visitingeu\/4324944457\/sizes\/m\/in\/photostream\/\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-1626\" style=\"margin: 4px\" src=\"http:\/\/blog.law.cornell.edu\/voxpop\/files\/2011\/09\/stonehenge.jpg\" alt=\"A Pile of Rocks\" width=\"300\" height=\"225\" \/><\/a>The metadata story on the paywalled sites is very straightforward: exposing it would not be in the vendor&#8217;s commercial interest, so there isn&#8217;t any. It&#8217;s hard to fault the logic. Even if we insist on the unflattering feudal analogy with which this essay opened, it&#8217;s worth remembering that Japan&#8217;s Shogunate endured for 250 years before finally giving way to change. Business opportunities don&#8217;t come much better than that, and one can hardly expect the leading providers to react any differently.<\/p>\n<p>There is variety in the ecosystem, however, and not all suppliers of legal source are driven by the same pattern of economic incentives. Providers that expose their content with metadata stand to benefit from CSL and other infrastructure-in-waiting, which can significantly raise the real value of their service. To state the point more precisely: supplying <em>fine-grained<\/em> metadata is essential for a publisher\u2019s content to be attractive to third-party reference management tools like Zotero \u2014 it\u2019s important enough to be in the project\u2019s <a href=\"http:\/\/www.zotero.org\/support\/dev\/exposing_metadata\">guidance notes<\/a>.<\/p>\n<p>This is a separate point from the movement for <a href=\"http:\/\/universalcitation.org\/\">universal or format-neutral citation formats<\/a>. Promotion of these is also important, but from the perspective of data acquisition, they are not sufficiently uniform <em>across jurisdictions<\/em> to serve, by themselves, as primary metadata for a general research platform. As a well-intended example, consider this tag embedded in a case from <a href=\"http:\/\/www.canlii.org\/\">CanLII<\/a>:<\/p>\n<pre>&lt;meta name=\"DC.Title\"\r\n      content=\"Smith v. Jones, 2003 CanLII 19166 (NWT RO)\"\/&gt;<\/pre>\n<p>In order to register this item in a reference manager database, we need to know what each of the elements <em>means<\/em>. This will be obvious to a local practitioner, but a Zotero page translator would need to include hand-crafted pattern-matching functions to parse out the elements and assign them to field variables. If I were doing the coding (ignorant as I am of Canadian law), I would be stumped by several of these elements:<\/p>\n<dl>\n<dt>19166<\/dt>\n<dd>From the size of the number, I guess that it is a document identification number, but if it were smaller, I might mistake it for a page number. That could result in my misclassifying the cite as one to a printed reporter, and that in turn could affect the formatting of pinpoint identifiers in styled output generated from the harvested data.<\/dd>\n<dt>NWT<\/dt>\n<dd>This appears to be a geographic identifier (Northwest Territories?), but I am not sure. I am also unsure whether such identifiers <em>always<\/em> appear in citations; whether or not they might include spaces, numbers, or other characters; and what the full set of possible identifiers looks like.<\/dd>\n<dt>RO<\/dt>\n<dd>This one has me completely stumped, so I would be mailing friends who might know something about Canadian law.<\/dd>\n<\/dl>\n<p>The answers would be obvious to a Canadian lawyer, of course, and with a bit of effort I could look up the details. But multiplied across the jurisdictions of the world, that is an effort that would prove fatal to the task. A <span style=\"font-family: mono\">meta<\/span> tag containing a full formatted citation is better than nothing, but with fine-grained metadata and simple descriptive variable names for each of the elements, the code would practically write itself. It really does make all the difference.<\/p>\n<p><a href=\"http:\/\/www.flickr.com\/photos\/michaelreuter\/4438541418\/\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft\" style=\"margin: 4px\" src=\"http:\/\/farm3.static.flickr.com\/2799\/4438541418_9ca513985f.jpg\" alt=\"\" width=\"300\" height=\"215\" \/><\/a><\/p>\n<p style=\"margin-top: -1em\">A further issue concerns parallel references, which I mention here for the sake of completeness in ranting. In a world that offers an <a href=\"http:\/\/developers.facebook.com\/docs\/reference\/api\/application\/\">API<\/a> to the entire <a href=\"https:\/\/graph.facebook.com\/102452128776\">fictional economy of Farmville<\/a>, one would think that the various and sundry parallel citations to, say, <a href=\"http:\/\/scholar.google.com\/scholar?q=quackenbush+v+us&amp;hl=en&amp;btnG=Search&amp;as_sdt=2%2C5\">Quackenbush v. US<\/a> would be available as a simple machine-readable graph. But as we have seen, the leading paywalled providers don&#8217;t even supply the <em>date of the decision<\/em> in structured form, let alone parallel citation mappings: the data they publish is basically useless for this purpose.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignright size-full wp-image-1523\" style=\"margin-top: 2em;clear: left\" src=\"http:\/\/blog.law.cornell.edu\/voxpop\/files\/2011\/09\/quackenbush-refactored.png\" alt=\"\" width=\"400\" height=\"101\" \/>The least-painful path at present is to visit <a href=\"http:\/\/scholar.google.com\/\">Google Scholar<\/a> with <a href=\"http:\/\/www.zotero.org\/\">Zotero<\/a>, and fetch the case from the hit listing (not from the case page itself). This yields a set of three cross-linked items that reflect the parallel reports of the case. One click and you&#8217;re away \u2014 but consider what happens behind the scenes: (1)\u00a0the Zotero translator performs <a href=\"https:\/\/github.com\/zotero\/translators\/blob\/master\/Google%20Scholar.js#L91\" target=\"_blank\">contorted screen-scraping<\/a> of (2)\u00a0the <em>displayed citations<\/em> in the Google listing, which (3)\u00a0were in turn reverse-engineered from scanned source, and hence (4)\u00a0cannot be trusted for 100% accuracy. It is a testament to human ingenuity that this is possible at all, but the underlying infrastructure is an embarrassing bundle of wet string.<\/p>\n<p>Parallel references are tracked internally, of course, by the major service providers. Lack of user-side access to these mappings has the side effect (bizarre, from the standpoint of other fields) of placing uncommon importance on human-readable citations, because they are the only available means of identifying a given case across multiple data silos. Given current publishing arrangements, the problem is intractable, and for the present the best we can do on the reference management side is to provide means of recording these relations in personal libraries when they are identified by individual users.<\/p>\n<h1>In lieu of concluding<\/h1>\n<p><a href=\"http:\/\/www.geograph.org.uk\/photo\/745306\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-1665\" style=\"margin: 4px\" src=\"http:\/\/blog.law.cornell.edu\/voxpop\/files\/2011\/09\/buds-1.jpg\" alt=\"\" width=\"300\" height=\"234\" \/><\/a>To end on a positive note, compliments are due to the growing number of publishers and dissemination initiatives that have gone the distance to expose well-structured metadata. In the <a href=\"http:\/\/citationstyles.org\/\">CitationStyles.org<\/a> project, my own immediate aim is to get the CSL output story into shape, and I confess that I have not followed recent (and some not-so-recent) developments as closely as I should. As styles firm up and field assignment conventions come to be settled, I&#8217;ll be looking forward to work (by others, as well as a bit myself) on serving the growing number of open-access legal publishers that provide structured metadata.<\/p>\n<p>Zotero is a flexible feeder, and the specific format in which metadata is presented is less important than that it be separated into discrete fields. The <span style=\"font-family: mono\">meta<\/span> <a href=\"http:\/\/www.law.cornell.edu\/supct\/html\/08-1448.ZS.html\">field assignments<\/a> in <a href=\"http:\/\/www.law.cornell.edu\/\">the Cornell LII<\/a> Supreme Court judgments (CASENAME, DOCKET, DECDATE) serve the purpose. The <a href=\"http:\/\/scholar.google.com\/scholar.bib?q=info:VNElXd6SeYgJ:scholar.google.com\/&amp;output=citation&amp;hl=en&amp;as_sdt=2,5&amp;ct=citation&amp;cd=0\">BibTeX<\/a> source served by Google Scholar works as well. The legislative metadata at <a href=\"http:\/\/www.legislation.gov.uk\/uksi\/2011\/2305\/contents\/made\">legislation.gov.uk<\/a> also works. The <a href=\"https:\/\/law.resource.org\/pub\/reporter\/YesWeScan\/0044\/0044.f.0001.html\">microformats metadata<\/a> embedded in Federal cases on <a href=\"http:\/\/law.resource.org\/\">law.resource.org<\/a> gives us enough to work with, and the very complete details in the <a href=\"https:\/\/law.resource.org\/pub\/\">RECOP<\/a> material are quite useful when they are carried through in refactored pages (as <a href=\"http:\/\/www.freelawreporter.org\/flrdoc.php?uuid=8a47aee8-6b83-52ff-ac84-a68d17521ce5\">they are<\/a> in <a href=\"http:\/\/blog.law.cornell.edu\/voxpop\/2011\/05\/25\/the-free-law-reporter-open-access-to-the-law-and-beyond\/\">the Free Law Reporter<\/a> served by <a href=\"http:\/\/www.cali.org\/\">CALI<\/a>).<\/p>\n<p>One of the benefits to be anticipated, as we make our way toward improved interoperation between publishers and third-party reference management tools for law, is a reduction in the barriers to collaboration between law and other disciplines.  Legal citation conventions are by nature quite demanding, and removing some of their sting will improve access not only to the law itself, but also to participation in its discourse.<\/p>\n<p>All signs of rain, and very welcome for grassroots projects like CSL.<\/p>\n<hr \/>\n<p><a href=\"http:\/\/gsl-nagoya-u.net\/faculty\/cache\/gsliF_Bennett.html\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-1675\" style=\"margin: 4px\" src=\"http:\/\/blog.law.cornell.edu\/voxpop\/files\/2011\/09\/bennett-small.jpg\" alt=\"\" width=\"150\" height=\"200\" srcset=\"https:\/\/blog.law.cornell.edu\/voxpop\/files\/2011\/09\/bennett-small.jpg 285w, https:\/\/blog.law.cornell.edu\/voxpop\/files\/2011\/09\/bennett-small-225x300.jpg 225w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/a><a href=\"http:\/\/gsl-nagoya-u.net\/faculty\/cache\/gsliF_Bennett.html\"><strong>Frank Bennett<\/strong><\/a> is Associate Professor in <a href=\"http:\/\/gsl-nagoya-u.net\/\">the Graduate School of Law at Nagoya University<\/a>. His active projects related to legal informatics include the <a href=\"https:\/\/bitbucket.org\/fbennett\/citeproc-js\/wiki\/Home\">citeproc-js<\/a> CSL processor, an experimental multilingual branch of the <a href=\"http:\/\/www.zotero.org\/\">Zotero<\/a> reference manager (<a href=\"http:\/\/www.zotero.org\/blog\/new-release-multilingual-zotero-with-duplicates-detection\/\">MLZ<\/a>), and the <a href=\"http:\/\/citationstylist.org\/\">CitationStylist.org<\/a> initiative for creating a CSL family of legal styles.<\/p>\n<p>The CSL language was <a href=\"http:\/\/web.archive.org\/web\/20100211211113\/http:\/\/community.muohio.edu\/blogs\/darcusb\/archives\/2004\/08\/13\/processing-citations\">originally conceived<\/a> by <a href=\"http:\/\/www.users.muohio.edu\/darcusb\/index.html\">Bruce D\u2019Arcus<\/a>. The CSL 1.0 schema and specification are maintained by <a href=\"http:\/\/www.users.muohio.edu\/darcusb\/index.html\">Bruce D\u2019Arcus<\/a> and <a href=\"http:\/\/www.linkedin.com\/in\/rintzezelle\">Rintze Zelle<\/a>.<\/p>\n<p>(Readers should kindly note that despite Frank&#8217;s tasteful choice of hat in the photo to the left, the views expressed in this post are his own, and do not necessarily reflect those of Cornell University or the Cornell Legal Information Institute.)<\/p>\n<p>VoxPopuLII is edited by <a href=\"http:\/\/www.judithpratt.com\/\">Judith Pratt.<\/a> Editor-in-Chief is <a href=\"http:\/\/legalinformatics.wordpress.com\/about\/\">Robert Richards<\/a>,   to whom queries should be directed. The statements above are not legal   advice or legal representation. If you require legal advice, consult a   lawyer. <a href=\"http:\/\/lawyers.law.cornell.edu\/\">Find a lawyer<\/a> in <a href=\"http:\/\/lawyers.law.cornell.edu\/\">the Cornell LII Lawyer Directory<\/a>.<\/p>\n<!-- AddThis Advanced Settings generic via filter on the_content --><!-- AddThis Share Buttons generic via filter on the_content -->","protected":false},"excerpt":{"rendered":"<p>In the wake of a decisive victory at the Battle of Sekigahara in 1600, Tokugawa Ieyasu treated rival Japanese warlords to a simple but effective instrument of control, pioneered in the preceding Era of the Warring States. The Daimyo, as the defeated clan heads were known, retained control of their respective domains, but were required <a href='https:\/\/blog.law.cornell.edu\/voxpop\/2011\/10\/02\/csl-metadata-and-legal-information-that-just-works\/'>[&#8230;]<\/a><!-- AddThis Advanced Settings generic via filter on get_the_excerpt --><!-- AddThis Share Buttons generic via filter on get_the_excerpt --><\/p>\n","protected":false},"author":14,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[328,327,330,329],"tags":[4836,4835,4827,4830,4829,4828,4837,4832,4833,4831,4834,4826],"class_list":["post-1342","post","type-post","status-publish","format-standard","hentry","category-legal-citation","category-legal-citations","category-legal-descriptive-metadata","category-legal-metadata","tag-bluebook","tag-citation-of-legal-authorities","tag-citation-style-language","tag-citationstylist-org","tag-citeproc-js","tag-frank-bennett","tag-legal-bluebook","tag-legal-citation-management-software","tag-legal-citation-management-systems","tag-legal-citation-software","tag-public-access-to-legal-metadata","tag-zotero"],"_links":{"self":[{"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/posts\/1342","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/users\/14"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/comments?post=1342"}],"version-history":[{"count":491,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/posts\/1342\/revisions"}],"predecessor-version":[{"id":1909,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/posts\/1342\/revisions\/1909"}],"wp:attachment":[{"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/media?parent=1342"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/categories?post=1342"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/tags?post=1342"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}