User:Kaldari/Task 2

Create a web page that takes the name of a Wikipedia category as input (either through a query string parameter or a form input) and outputs a sorted list of articles with scores. The scores should reflect how readable the first paragraph of each article is. The list should be sorted from least readable to most readable. Readability can be assessed by any method of your choosing. For example, you could consider factors such as length, number of parenthetical phrases, number of commas, average word size, etc. or you could use a 3rd party readability test API or library. Don't worry about handling subcategories or pagination for large categories (feel free to truncate the output list at 50 articles). Don't worry about supporting non-English Wikipedias.

The code should be written in PHP, Python, or Javascript and hosted on Github, bitbucket, or a similar site.

The web page can be hosted on Tool Labs or a personal server.

Hints: API:Categorymembers, Extension:TextExtracts

This task should take about 3 hours to complete.