User:Nealindia/Mock

Proof of Concept for my idea
What I am planning here is, I will create a Content Analyser tool which will scan through the whole Article.

Now we have the article, then we will be extracting references to places, city, towns, special places, etc from these articles.

Using geocoding services like of Google (http://code.google.com/apis/maps/documentation/services.html#Geocoding), we can get the coordinates of that referenced place and mark it on the map or mark an area of interest (KML)

Now, lets consider an example: I have copied the following paragraph from Wikipedia page of Barack Obama: --

Barack Obama was born at Kapi'olani Maternity & Gynecological Hospital in Honolulu, Hawaii, United States,[4] to Stanley Ann Dunham,[5] an American of predominantly English descent from Wichita, Kansas,[6] and Barack Obama, Sr., a Luo from Nyang’oma Kogelo, Nyanza Province, Kenya Colony. Obama is the first President to have been born in Hawaii.[7][8] Obama's parents met in 1960 in a Russian language class at the University of Hawaii at Mānoa, where his father was a foreign student on scholarship.[9][10] The couple married on February 2, 1961,[11] and Barack was born later that year. His parents separated when he was two years old and they divorced in 1964.[10] Obama Sr. remarried and returned to Kenya, visiting Barack in Hawaii only once, in 1971. He died in an automobile accident in 1982.[12]

--

Now as we see, the only relevant information we want to be shown on the map is shown in bold and other geographical information which we don't need is shown in bold & italics. Using NLP techniques, we can disregard irrelevant & repeated information. And use only places which are referred with words like born at Hawai, lives at NY, works at White House, etc.

Once we extract this information, using the geocoding service which I mentioned above, we can get the actual coordinates of the place and thereby can easily integrate the maps into our articles!

This technique would become fundamental to the integration of OpenStreetMaps(OSM) into Wikipedia and other MediaWiki websites and help in easy integration of the maps without much work.

Also, I plan to place locations of all contributors of an article on that page so as to better understand the geographical awareness about that topic.

Please feel free to edit this article and put in your suggestions.

Discussion
Could you explain the benefit of your extraction in comparision with the existing geocoords extraction that we have?
 * http://toolserver.org/~dispenser/cgi-bin/locateCoord.py?dbname=coord_dewiki&lon=13.741347&lat=51.058114&range_km=10
 * daily update by using the database entries for links to geohack in the externallink table.


 * We also have a tool that shows all links in an article in a map:
 * http://maps.google.com/maps?f=q&source=s_q&hl=de&geocode=&q=http:%2F%2Ftoolserver.org%2F~para%2Fcgi-bin%2Fkmlexport%3Fproject%3Den%26article%3DDresden_Frauenkirche%26linksfrom%3D1&sll=51.051944,13.741666&sspn=0.000101,0.000279&ie=UTF8&t=h&z=3
 * What will be the benefit of your way?


 * If you will extensively use the geocoding methods of google we will need to talk about the license of the results. Are this results really free? In european union we have there perhaps some copyright problems.


 * Why you don't want to use toolserver where you would have all wikipedia-databases accessable?

It hope this questions helps to clearify your proposale. --Kolossos 13:02, 4 April 2010 (UTC)