Extension talk:ManualSitemap

Sitemap Output Location
I have clean URL's enabled. So I have links like "http://MySite.com/wiki/page".

When I generate a sitemap I am always given the link "http://MySite.com/w/sitemap.xml". I can open that file and everything looks fine. However if I submit this link to Google Webmaster Tools, Google can't see the sitemap and it will show "Submitted URLs 0". I have to move the generated file into "http://MySite.com/sitemap.xml" then I can resubmit it to google and Google can then read it. How can I set your extension so that whenever I generate a sitemap, the file goes to "http://MySite.com/sitemap.xml"?

I noticed on your generated sitemap example page "http://jehy.ru/wiki/sitemap.xml" that your sitemap isn't in the "w" folder, how did you do that? Holygamer 17:40, 5 October 2010 (UTC)


 * Better late then never. In my case, install path for wiki is not /w but /wiki - so it's pretty same. I think that your problem is in your "http://MySite.com/robots.txt" file - it should point to your sitemap but I think that it does not. Jehy (talk) 06:01, 10 April 2013 (UTC)

Special:ManualSitemap Options
Could you please explain what these options mean:

Sorting criterion:


 * Popularity
 * Last Revision

Estimate revision frequences

Priority:
 * Constant
 * Linear
 * Quadratic
 * Cubic
 * Exponential
 * Smooth
 * Random
 * Reverse

What options would you recommend (particularly for Google?)

What exactly is the "moreover" entry for that we put in LocalSettings.php? Moreover doesn't appear to be a search engine so what's the point of submitting a site map to it?

Thank you
Thank you for fixing the GoogleSitemap extension for the current release of MediaWiki (13.2). I have three suggestions for enhancements that would be useful to have:
 * persistent preferences for the options (Sorting, Popularity, Priority), perhaps through LocalSettings
 * a link for generating a sitemap with more than 500 pages, e.g. a Max link, again this could be set in the LocalSettings at a default value of say 500, and then leave it to the user to decide how high to set it. This would avoid the need to edit the value of &limit in the request URL by hand.
 * ping notification result message, and support for submission to other search engines besides Google

Also, a question... I noticed that priority is now only recorded to one decimal place... this has puzzled me for ages, is a higher precision (say 0.333 rather than 0.3) used by any of the search engines.

Quotes 22:05, 23 November 2008 (UTC)

Thanks
Yeah, I would also like to make these, when I have some time... Thank you for suggesting. By they way, you CAN submit notification to other search engines besides Google - especially for it I made $ManualSitemap_Notify an array. You can just set it to smth like

If you write here the URLS for notifying other search engines, I would greatly appreciate it and add them to the extension specification.

Setting options in LocalSettings.php is a great idea, thanks! You don't really want to change sitemap creation options every day :))

Possibly I will bother with it on weekends.

About the precision - can't say something definite. It isn't mentioned in sitemap standard or covered in google topics. Only one thing is for sure - it would do you no harm ;)

Jehy 23:44, 24 November 2008 (UTC)

Ping URLs
Quotes 10:58, 7 December 2008 (UTC)
 * http://www.google.com/webmasters/sitemaps/ping?sitemap=http://www.example.com/sitemap.xml
 * http://search.yahooapis.com/SiteExplorerService/V1/ping?sitemap=http://www.example.com/sitemap.xml
 * http://webmaster.live.com/ping.aspx?sitemap=http://www.example.com/sitemap.xml
 * http://api.moreover.com/ping?u=http://www.example.com/sitemap.xml
 * http://submissions.ask.com/ping?sitemap=http://www.example.com/sitemap.xml

Wow
That's a big list you've got there. I'll probably to make an ajax script for pinging instead of php - or timeout will occure...--Jehy 00:50, 9 December 2008 (UTC)

Special Characters
Special Characters in the URL lead to non valid URLs in the Sitemap.xml (i.e. ä,ö,ü) --FlyerWiki 15:13, 27 March 2009 (UTC)
 * Are troublesome special characters in namespace, or page title?Jehy 23:42, 29 November 2009 (UTC)

Download Site down
I really want to use this extension on my wikis (hopefully it will work on the non-english ones too), but the download site is down. It looks like it was up two weeks ago, according to comments. Can someone upload a copy to a new source in case the original server doesn't come back?--Webbstre 13:11, 15 May 2009 (UTC)
 * Sorry, my server was resting. It's okay now.Jehy 06:28, 8 December 2009 (UTC)
 * Down again. Any chance for a mirror?

Limit of pages
If I go to /w/index.php?title=Spezial:ManualSitemap&limit=5500&offset=0 it gives me a list of 5000 pages, though I have actually more pages in my Wiki. Is there a special reason why it seems to be limited to 5000 pages? Thanks + cheers, --Till Kraemer 10:34, 5 September 2009 (UTC)


 * I have the same question. How can we sitemap more than 5000? And by the way, thanks for this extension. It's incredibly useful. --Mark 17:21, 18 November 2009 (UTC)
 * Fixed.Jehy 23:38, 29 November 2009 (UTC)
 * I'm gonna check it out. Thanks! --Mark 20:19, 2 December 2009 (UTC)
 * Nice...it still includes discussion pages, but I guess that was your intention. I just sent the whole thing to Google. --Mark 04:36, 7 December 2009 (UTC)
 * Ok, now you can remove discussion pages from sitemap, please see new options. Jehy 06:28, 8 December 2009 (UTC)
 * You're the man...Seriously, check my contributions...of all the questions I've posed here in the last year you're the only one who replied AND did something. Kudos, bro, wish you the best of the best. Now let me test the new version :) This extension is really, really useful. --Mark 16:25, 12 December 2009 (UTC)
 * That's because it was me who decided to rewrite and support this extension... Still, I forgot about it for this year :) I'm glad there are people who need this, please continue with feedback. Jehy 09:00, 13 December 2009 (UTC)
 * It's been a few months now and this extension still works great. Still on 1.15, but not planning to upgrade until a definite mediawiki release. Just one question I'm curious about: when you say google complains about constant priority, which priority do you recommend for google?--Mark 16:28, 29 April 2010 (UTC)

Can not exclude URLs with : (colon)
Why can I not exclude URLs with colon? I tried different ways to escape that colon but none worked! Did somebody succeded? Thanks Nick

Priority to specific pages
Is there a way to set the static priority to a specific page? Some var in the LocalSettings.php? Grosso 09:09, 3 May 2010 (UTC)

MW Version
It works also fine in MW 1.12.4 --188.195.254.42 10:16, 26 June 2010 (UTC)

Fixed for MW 1.19.2
Hello, I am the author of extension. Sorry for not updating for years :)

Now extension is working fine with 1.19.2 (thank you for bugfixes), and is published on github. Download link is on extension page, enjoy! Jehy (talk) 20:14, 17 September 2012 (UTC)

Stylesheet error
I get the following message when I try to look at the sitemap with a browser:

Error loading stylesheet: A network error occurred loading an XSLT stylesheet:

http://mysite.com/sitemap.xsl

I've looked at everything I can think of. Any ideas? DaveL (talk) 01:46, 6 June 2013 (UTC)

Mediawiki 1.18

Please advise
A database query error has occurred. This may indicate a bug in the software.

Query: SELECT "Popularpages" AS type,page_id AS id,page_namespace AS namespace, page_title AS title, ( MAX( rev_timestamp ) ) AS last_modification,page_counter AS value FROM `mw_page`, `mw_revision` WHERE ( page_namespace <> 8 AND page_namespace <> 9 AND page_namespace <>"1" AND page_namespace <>"2" AND page_namespace <>"3" AND page_namespace <>"5" AND page_namespace <>"7" AND page_namespace <>"8" AND page_namespace <>"9" AND page_namespace <>"10" AND page_namespace <>"11" AND page_namespace <>"12" AND page_namespace <>"13" AND page_namespace <>"14" AND page_namespace <>"15") AND page_is_redirect = 0 AND rev_page = page_id GROUP BY page_id Function: DatabaseBase::query Error: 1054 Unknown column 'page_counter' in 'field list' (localhost)