Topic on Project:Support desk

How to separate sitemap into series small file?

4
Summary by Seb35

Tracked in phabricator:T65098.

Deletedaccount4567435 (talkcontribs)

I am using Manual:GenerateSitemap.php to generate sitemap now.

The world largest Chinese search engine Baidu notified me that they can't handle any sitemap larger than 50000 records. How can I separate sitemap every 50000 records? By default Generatesitemap.php separate siemap by namespace. However, even single "sitemap-zhmoegirlwiki-NS_0-0.xml.gz" contain more than 50000 links.

MarkAHershberger (talkcontribs)

I don't think you can split the sitemap up like that. If that is what Baidu wants, then you should file a bug to get splitting support.

Deletedaccount4567435 (talkcontribs)

Then is it possible to disable language variants link in sitemap? For example the sitemap will contain language variants in URL which actually all point to same page. I only want to keep "http://zh.moegirl.org/Help:DynamicPageList".

<url> <loc>http://zh.moegirl.org/Help:DynamicPageList</loc> <lastmod>2013-03-25T01:55:56Z</lastmod> <priority>0.5</priority> </url> <url> <loc>http://zh.moegirl.org/zh-cn/Help:DynamicPageList</loc> <lastmod>2013-03-25T01:55:56Z</lastmod> <priority>0.5</priority> </url> <url> <loc>http://zh.moegirl.org/zh-tw/Help:DynamicPageList</loc> <lastmod>2013-03-25T01:55:56Z</lastmod> <priority>0.5</priority> </url>

Ciencia Al Poder (talkcontribs)

That's probably a bug. You should report that as well

Reply to "How to separate sitemap into series small file?"