{"id":2756,"date":"2012-09-17T11:59:47","date_gmt":"2012-09-17T16:59:47","guid":{"rendered":"http:\/\/blog.law.cornell.edu\/voxpop\/?p=2756"},"modified":"2012-09-17T11:59:47","modified_gmt":"2012-09-17T16:59:47","slug":"standardizing-the-worlds-legislative-information-one-hackathon-at-a-time","status":"publish","type":"post","link":"https:\/\/blog.law.cornell.edu\/voxpop\/2012\/09\/17\/standardizing-the-worlds-legislative-information-one-hackathon-at-a-time\/","title":{"rendered":"Standardizing the World\u2019s Legislative Information\u2014One hackathon at a time"},"content":{"rendered":"

As guest bloggers to this site, we have been asked to write about big ideas. \u00a0We\u2019ll get to those. But first, a note about hackathons.<\/p>\n

\"Hackathon

Could legal hackathons be like this one day?<\/p><\/div>\n

Hackathons used to be the exclusive domain of soda-and-coffee-guzzling, pizza-eating, all-night hacking, highly competitive computer programmers. The result of such a hackathon is often supposed to be a cool app (like the forerunner of Twitter) that is even cooler because it was built in the compressed schedule of the event. More recently, hackathons have been popping up in a variety of places, with some unexpected contexts and sponsors including the U.S. House of Representatives<\/a>, NASA<\/a>, Brooklyn Law School<\/a>, New York City government<\/a>, and others. These events serve as a way to prove (or build) the sponsor\u2019s tech credentials and to cross-fertilize policy and technology expertise.\u00a0 There has been some handwringing and thoughtful commentary<\/a> about the expansion of \u201ccivic\u201d hackathons and what sustainable outcomes they produce.<\/p>\n

As co-organizers, with Karen Suhaka<\/a>, Greg Wilson, Charles Belle<\/a> and others, of two legislative focused, hackathon-inspired events–the California Law Hackathon<\/a>, and the International Legislation Un-hackathon<\/a>–we can attest to their value in bringing engineers and lawyer and policy folks together. We can give some insights into the kinds of benefits these events have had in propelling efforts on legislative data standards, and some of the advances that have taken place in the development of these standards over the last year.<\/p>\n

Big Idea: Legislative Data Standards<\/h2>\n

And now to the big idea: to represent all the world\u2019s legislation in a standard structured data format. That\u2019s actually two big ideas: (1) putting legislation into a structured data format, and (2) designing that format so that it is compatible with the wide variety of laws and legislative document types worldwide.<\/p>\n

There are reasons for doing these things: First, introducing structured data to legislation can make it possible to search and analyze the law with greater precision and efficiency. And second, having a common standard can permit more comprehensive bill-tracking and comparison between jurisdictions.<\/p>\n

\"California<\/a>

California Bill with Metadata<\/p><\/div>\n

It also can make it possible for legislatures with small (and shrinking) budgets to benefit from some of the same bill drafting software that is being developed for much larger jurisdictions. (Full disclosure: Xcential has developed such software for more than ten years, including the drafting platform used by the State of California.)<\/p>\n

In the age of Google, these ideas may not seem so big; in fact, they are a subset of Google\u2019s far-reaching mission<\/a>. However, legislation is a corner of the world\u2019s information that Google has not yet addressed in a systematic way. And as regular readers of this blog know, legislation presents its own hurdles, technical and bureaucratic (not necessarily in that order), that make this both an interesting and a challenging problem. One of the challenges is that the kind of people who generally work with data (we\u2019ll call them engineers) and the kind of people who generally work with legislation (we\u2019ll call them lawyers and policy folks) don\u2019t often work on data and legislation together.\u00a0 One of us, a lawyer and policy type, has made this point graphically<\/a> (and somewhat hyperbolically) in a Quora response to a question about whether version control software could be used for legislation. That question, and a subsequent discussion<\/a> generated in response to a blogpost<\/a> by software engineer Abe Voelker about version control for legislation, drew in many engineers and some lawyers and policy folks.<\/p>\n

For software engineers who consider such things, it is very attractive to think about treating legal text as if it were software code; we could automatically highlight and cross-validate key terms, run test cases, automate redlining and version control, etc. It would be easy to see what the state of the law was at any particular point in time, and to trace the series of amendments that got us into the mess we\u2019re in today. This desire is often expressed as \u201cWhat if we had a Github<\/a> for legislation?\u201d On the other hand, people who work closely with legislation–researching it, drafting it or developing information systems to deal with it–tend to see the many places that the analogy between computer code and legal code break down. Legal texts have been shaped over hundreds of years by technologically conservative institutions, using print-based systems.<\/p>\n

The full transformation of law to digital information is not going to happen overnight. While most law is already accessible in electronic format (often pdf), it is not encoded in a way that software engineers could start using their favorite text-munching tools. One of us, an engineer, has described this<\/a> as the difference between computerization and automation. The move toward better digital tools for automating legislative drafting and research tasks will require more dialogue and working exchanges between engineers and the lawyers and policy folks.<\/p>\n

That brings us back to hackathons.<\/p>\n

What is a Legislative Hackathon?<\/h2>\n

Recognizing the need to bring lawyers and engineers together in order to implement our big idea(s)<\/strong>, and appreciating the valuable bandwagon that hackathons have become, we decided to jump onboard. The first event we organized, the California Law Hackathon<\/a>, was hosted just over a year ago, in September 2011, in Berkeley at the offices of Maplight<\/a>, and in Denver by Karen Suhaka\u2019s team at BillTrack50<\/a>. The event focused on building web-based visualization tools to track the timeline of amendments to California legislation, and to link particular amendments, through their legislative sponsors, to particular donors or interest groups.\u00a0 We were joined remotely by a number of international participants, including John Sheridan<\/a>, head of e-services for legislation.gov.uk, and a fellow guest contributor to this blog. As one participant noted<\/a>, we learned a great deal at the event, including the limits placed on us by the existing data. Neither the legislative record, nor the donations databases are detailed enough to trace influence in politics in the way we hoped. This helped spark an interest in a more in-depth exploration of legislative data formats, and in particular how more and better metadata could be added to legislation.<\/p>\n

That led to the International Legislation Un-hackathon, held simultaneously at UC Hastings, Stanford and Denver, with participants from the University of Bologna (Ravenna campus) and around the world. So assuming you can get engineers together with lawyers and policy folks, what do you do with them?\u00a0 We decided that we\u2019d need a user-friendly tool that could be used to explore and add metadata to legislation from around the world.\u00a0 This could highlight a developing legislative XML standard, Akoma Ntoso (more about this standard soon), and give hands-on experience to lawyer and policy types kinds of text and analysis tools that engineers take for granted.<\/p>\n

Hacking With A Legislative Editor<\/h2>\n

So one of us (the engineer, naturally) started building a web-based editor for legislation<\/a>, while the other (the lawyer, naturally) started organizing the next hackathon. Of course, thought the lawyer, it would just<\/p>\n

\"Web-based<\/a>

Legislative Editor at legalhacks.org<\/p><\/div>\n

be a matter of time before all governments worldwide use such editors to draft their laws and regulations in a standard data format.<\/p>\n

Advances in Legislative Data Standards Efforts<\/em><\/p>\n

Akoma Ntoso<\/h2>\n

Akoma Ntoso<\/a> (AkN) is a strong contender to be that format.\u00a0 Developed under the auspices of the UN Department of Economic and Social Affairs<\/a>, AkN is an XML data structure<\/a> that is meant to capture high-level forms and semantic ideas that are common to a broad variety of legal texts.\u00a0 OASIS<\/a>, the folks who brought us the DocBook standards, among others, have convened a standards committee to create an official legislative data standard based on AkN. (More disclosure: the engineer is a member of this committee.) There\u2019s just one problem. Few governments are using AkN to draft or store their legislation.<\/p>\n

AkN itself is fast evolving, and with more exposure to legal data structures from different jurisdictions, the OASIS committee will be able to adapt AkN to better model those structures.<\/p>\n

We saw the International Legislative Un-Hackathon<\/a> as a venue to kick off this process. It was conceived with Charles Belle of UC Hastings, as part of the Legal Hacks<\/a> initiative.\u00a0 The event was held simultaneously at UC Hastings, Stanford, in Denver.\u00a0 Jim Harper and Francis Avila of the Cato Institute came to the Hastings Event. We also had many international participants. Key among them were Professors\u00a0Monica Palmirani\u00a0and Fabio Vitali\u00a0of the University of Bologna, the architects and primary evangelists of AkN.\u00a0 Over the course of the day, participants learned about AkN and, importantly, got a chance to try it out, marking up documents of their choosing with the web editor.\u00a0 In the process, as expected, we found bugs in the software and bugs in the standard.\u00a0 We found structures in U.S. legislation that didn\u2019t fit well with the existing AkN element set. We saw places where there was confusion in applying AkN\u2019s data structures to documents.\u00a0 All of this information was collected to incorporate in the development of both the editor and AkN, underscoring again the importance of getting more practical exposure for both.<\/p>\n

University of Bologna Summer School–Ravenna<\/h2>\n

And we are working to expand the venues for this kind of practical exposure to develop the AkN standard. Every September, the University of Bologna hosts the\u00a0LEX Summer School<\/a>\u00a0in Ravenna, Italy. For them, it’s an opportunity to introduce Akoma Ntoso to new groups of students from around Europe and around the world. For the students, it’s an opportunity to learn about the application of XML to legislation, see the success various groups are having around the world, and to meet interesting new people having a passion for legal informatics. One of us, the engineer, who was a student two years ago, was invited to return last year to present a success story, and this year is returning once more to deliver a class in how to build and use the HTML5-based editor for drafting legislation in XML. For us, this is an opportunity to expose the editor to the European legal traditions in order for us to better understand how our editor must evolve to fulfill our vision of a unified standard around the world with common, highly adaptable, tools.<\/p>\n

Chile National Library of Congress Browser-based editor<\/h2>\n

Another step toward adoption of legislative data standards is a project by Chile’s National Library of Congress<\/a>\u00a0(BCN in Spanish) called the “History of the Law” (Historia de la Ley). \u00a0This ambitious project aims to bring together machine learning, a legislative editor and other features to mark up Chile’s legislative record and other legislative documents. \u00a0The BCN has chosen Xcential’s browser-based editor, working with the AkN standard, to conduct the mark-up and correction after documents are passed through an automated parser. As with the hackathon, but on a larger scale, we are learning from experience the modifications that are needed to AkN, to make it work with Chile’s live documents. \u00a0Excitingly, each mismatch we find between AkN and actual legislation can be fed back into the OASIS committee process, to make AkN able to handle a wider variety of real-world use cases.<\/p>\n

Other Efforts and the Future of Legislative Data Standards<\/h2>\n

We see these steps as just the beginning. European governments are also flirting with legislative standards, and\u00a0Karen Suhaka’s group at\u00a0BillTrack50<\/a>\u00a0has converted all U.S. bills from all states into a single standard XML format showing that the technical hurdles can be overcome, and many of the practical benefits of doing so. \u00a0In focusing on the projects (and hackathons) we are most closely involved with, we have certainly left out a lot of the initiatives that are advancing legislative data standards around the world. \u00a0That’s what the comments are for. Let us know your experience with Akoma Ntosa as a legislative standard, and what you’re doing or interested in doing with AkN or other legislative data standards worldwide.<\/p>\n

\"\"<\/a>Grant Vergottini (the engineer) is a founder of Xcential<\/a>. He is a leading authority on applications of XML data to legislation.\u00a0Prior to founding Xcential, Grant was the Director of Applications at Chrystal Software, a company dedicated to XML design and reporting software. Before Chrystal, Grant led the redesign of Homestore.com, and founded Genedax Design Automation, which developed innovative team and data management applications for electronics design. Bringing data structures and automation tools to the legislative drafting process parallels the work that Grant did earlier in his career at Mentor Graphics and the Boeing Company, where he participated \u00a0in the transformation from manual drafting to CAD software.\u00a0Mr. Vergottini holds a Bachelor of Science in Electrical Engineering from Cleveland State University, where he graduated Summa Cum Laude.<\/p>\n

\"\"<\/a>Ari Hershowitz (the lawyer) is a consultant at Xcential, and founder of Tabulaw<\/a>. Tabulaw develops software for lawyers, including a web-based legal research and writing platform. Prior to Tabulaw, Ari worked to protect wildlife and habitats from Chile to Mexico\u00a0as Director of the BioGems project for Latin America at the Natural Resources Defense Council<\/a>. Ari has a law degree from Georgetown University Law Center, a Masters in Computation and Neural Systems from Caltech, and a Bachelors in Molecular Biophysics & Biochemistry from Yale College.<\/p>\n

VoxPopuLII is edited by Judith Pratt<\/a>. Editors-in-Chief are Stephanie Davidson<\/a> and Christine Kirchberger<\/a>, to whom queries should be directed.<\/p>\n

[Editor’s Note<\/em>: For topic-related VoxPopuLII<\/em> posts please see: N\u00faria Casellas, Semantic Enhancement of legal information \u2026 Are we up for the challenge?<\/a>; Jo\u00e3o Lima, et.al, LexML Brazil Project<\/a>; and Rinke Hoekstra, The MetaLex Document Server<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"

As guest bloggers to this site, we have been asked to write about big ideas. \u00a0We\u2019ll get to those. But first, a note about hackathons. Hackathons used to be the exclusive domain of soda-and-coffee-guzzling, pizza-eating, all-night hacking, highly competitive computer programmers. The result of such a hackathon is often supposed to be a cool app […]<\/a><\/p>\n","protected":false},"author":96,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[466,299,238,598,608,329,464,4928,465,4],"tags":[4929],"_links":{"self":[{"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/posts\/2756"}],"collection":[{"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/users\/96"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/comments?post=2756"}],"version-history":[{"count":24,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/posts\/2756\/revisions"}],"predecessor-version":[{"id":2802,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/posts\/2756\/revisions\/2802"}],"wp:attachment":[{"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/media?parent=2756"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/categories?post=2756"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.law.cornell.edu\/voxpop\/wp-json\/wp\/v2\/tags?post=2756"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}