User:Slaporte/GSoC2010

Abstract
Creating a tool to format judicial decisions, legal scholarship, and statutes for Wikisource.

Identity
Name: Stephen LaPorte Email: stephen.laporte -at- gmail.com Project title: Wikisource Legal Tool

Contact/working info
Timezone: UTC-7 (Pacific Daylight Time) Typical working hours: Flexible

Project summary
WikiSource should be a repository of statutory law, judicial decisions, and legal scholarship. Prof. Timothy K. Armstrong identified Wikisource as solution to the architectural limitations of existing repositories for judicial decisions and legal scholarship. Prof. Armstrong listed three obstacles for Wikisource--legal, content, and cultural issues. The legal and cultural issues can be address through education and outreach. This project addresses the problem of content.

A tool to format judicial decisions and statutes will help users move text that is already electronically available and in the public domain to Wikisource, solving the "chicken-and-egg" problem that Wikisource currently faces. Once Wikisource has a substantial body of legal sources, users will gain value from and improve the coverage of those legal sources.

About me
I am currently a Juris Doctor candidate studying Intellectual Property Law at the University of California, Hastings College of the Law. In 2008, I received a Bachelor of Arts in English and Latin from the University of Nebraska, Lincoln. I am interested in open access to information, especially the law.

I have written small programs in PHP, used MediaWiki templates regularly in editing, and am familiar with the MediaWiki template system.

Deliverables
The tool should be able to sort through text from standard sources, identify important information, and apply templates and wiki formatting. The tool should be part of a workflow that allows users to efficiently move content on to Wikisource.

Required deliverables

 * Identify key information, such as the title, citation, and author from text or html
 * Sort the text, e.g. determining if it is a statute or a U.S. Supreme Court concurring opinion.
 * Apply wiki templates and categories based on the above information
 * Turn citations (full and short form) to cases, statutes, and secondary material into wiki links to the corresponding page on Wikisource
 * Format footnotes with tags
 * Develop workflow for users moving law and legal sources on to Wikisource

If time permits

 * Turn "Id." citations to links to corresponding page on Wikisource

Project schedule

 * 1) Identify electronic sources of judicial decisions and statutes, and determine copyright status
 * 2) Discuss with mentor and other people the best way to build the tool
 * 3) Review proof of concept with mentor
 * 4) Expand/rewrite script for identifying key information
 * 5) Expand script for applying templates and categories
 * 6) Automate templates within tool
 * 7) Write script to identify citations
 * 8) Write script to identify short-form citations
 * 9) Improve range of identifiable sources
 * 10) Develop workflow and integrate this tool

Key dates

 * May 24: Evaluate proof of concept
 * May 31: Work on function to identify key information
 * June 14: Work on function to identify citations
 * July 16: Submit tool for midterm evaluation
 * July 26: Test across sources
 * August 6: Finish tool, write documentation for workflow

Participation
I will communicate frequently with my mentor, and I will contact others (such as research librarians) who are familiar with electronically available legal sources. I have a clear vision of what I want the tool to do, and I want to discuss with a mentor the best technology and approach for doing it.

I see this tool as a first step for a WikiProject:Law on wikisource, so I am interested in building something effective.

Proof of concept
Proof of concept code is running here, with an explanation here.