Manual talk:Pywikibot/pagefromfile.py

From MediaWiki.org
Jump to navigation Jump to search
The following discussion has been transferred from Meta-Wiki.
Any user names refer to users of that site, who are not necessarily users of MediaWiki.org (even if they share the same username).

limitation?[edit]

I've used pagefromfile.py and when i try to import an article that is greater than a certain (250?) characters then it will fail. Does anyone know if there is a limitation to the length of the article? And so yes, what can you do about it? --BB70 13:20, 10 May 2006 (UTC)

I have not experienced anything like that error. Do you mean the page title is over 250 characters? --Connel MacKenzie 13:56, 20 July 2006 (UTC)
The article itself is over 250 characters including the title BB70 12:30, 28 July 2006 (UTC)
Don't work : it uploads only a part of each page.

title[edit]

I dont like to have automatically title in page so I inserted line

           contents = re.search("'''.*?'''([^\Z]*)",contents).group(1) 

before line

           page.put(contents, comment = commenttext, minorEdit = False)

--Liso 07:26, 25 June 2006 (UTC)

What didn't work?[edit]

Looking at a recent edit to this page, I see a complaint from someone that the proper technique didn't work. I know that on the English Wiktionary, the alternate technique now listed does not work correctly. Could someone please explain where/when/how that technique doesn't work, and what unexpected results were encountered? --Connel MacKenzie 14:00, 20 July 2006 (UTC)

python script which I downloded (I dont know if there is new version or not) does not found title of article because it search it betveen xxxx and yyyy ... --Liso 21:04, 26 July 2006 (UTC)

Creating Redirects[edit]

I tried creating redirects using Pagefromfile but it became an article.

My text file is like so...

xxxx
#REDIRECT [[Article]]
 '''Article'''
 [[Category:something]]
yyyy

the script created was so, except for the xxxx and yyyy. What I need is that the '''Article''' be not included. That the end result will become

#REDIRECT [[Article]]
 
 [[Category:something]]

What can be added to the script to achieve this? I do not know phyton programming. -- Hiong-eng

So do you need strip title from contents of inserted article? Please see Manual_talk:Pywikipediabot/pagefromfile.py#title! (you need to edit pagefromfile.py file):) --Liso 21:01, 26 July 2006 (UTC)
oops, sorry, I missed that :) thanks! -- Hiong-eng

This has been fixed now, there is a -notitle keyword. To make a redirect, make a file dict.txt that looks like this:

{{-start-}}
'''Article title'''#REDIRECT[[New title]]
{{-stop-}}

and type:

python pagefromfile.py -notitle

redirected page[edit]

Lets there are two pages A and B. B is redirected to A (B-->A)

I try to do the following

xxxx
'''B'''
bla-bla
yyyy

and as result I've got the error:

Traceback (most recent call last):
  File "/export1/wiki/pywikipedia/pagefromfile.py", line 324, in <module>
    main()
  File "/export1/wiki/pywikipedia/pagefromfile.py", line 320, in main
    bot.run()
  File "/export1/wiki/pywikipedia/pagefromfile.py", line 141, in run
    self.put(title, contents)
  File "/export1/wiki/pywikipedia/pagefromfile.py", line 170, in put
    contents = page.get() + contents
  File "/export1/wiki/pywikipedia/wikipedia.py", line 638, in get
    raise IsRedirectPage, self._redirarg

I would like to have such behavior: if B is redirected page then script have to use A as target page.

How I can do it? Can anybody help me? --Dnikitin 03:00, 22 December 2008 (UTC)

Error: invalid charactors[edit]

Sometimes, I got the following error

'utf8' codec can't decode byte 0xab in position 88757: unexpected code byte
ERROR: Invalid characters found on http://...wiki.ru/wiki/index.php5?useskin=monobook&title=Special:Allmessages&redirect=no&ot=xml, replaced by \ufffd.

What it mean and how I can fix it? --Dnikitin 04:25, 22 December 2008 (UTC)

Your input file is not encoded correctly. You must make sure that it is UTF-8 encoded. --130.126.229.82 17:13, 10 May 2011 (UTC)

Error: Unable to add unicode[edit]

I am trying to add unicode characters, the pagefromfile doesnt work for the same. My file contents are like this .. any help would be highly appreciated.

{{-start-}}

<<<perssimism>>>

==ಇಂಗ್ಲೀಷ್==

===ನಾಮಪದ===

'''{{subst:pessimism}}'''

# [[ನಂಬಿಕೆಯಳಿತ]]
{{-stop-}}

Editing Existing Pages Despite -safe?[edit]

I have a list of articles that I want to create (automatically creating Wiki pages from a non-Wiki source), but some of the pages have already been created manually. The -safe switch is supposed to have the bot ignore existing pages, but it doesn't seem to be doing so - it overwrites. Oddly, when I stopped the script after it did this to two pages, after trying it again it does skip those pages that it had created on previous tries - but not pages created by others. Any idea why this is? --DragoonWraith 00:59, 9 August 2010 (UTC)

I've had the same problem! The bot has replaced pages although I had selected -safe! That's bad. Kampy 21:56, 21 October 2011 (UTC)

Adding articles with template[edit]

I tried to upload several articles that have template before first line. Is there a command to bypass this? Regards, Klemen Kocjančič (Talk - Fast reply) 03:06, 27 December 2010 (UTC)

I don't understand: what's the problem? Perhaps you've bold text in the template and that's interpreted as the title? In that case, you just have to put the title first and use the -notitle parameter. --Nemo 10:03, 27 December 2010 (UTC)

Thanks for help. I had bold text in templates, and that was the problem. I'll watch out for future projects. Regards, --Klemen Kocjančič (Talk - Fast reply) 15:08, 27 December 2010 (UTC)

Extra parameter for title[edit]

Let's say that I want to add several disambig articles using this script. The articles can't have (disambiguation) in them (in article's bold text, so is it possible to set bot, that it will add (disambiguation) by itself after he picks up the title (bold text)? Regards, --Klemen Kocjančič (Talk - Fast reply) 05:13, 26 March 2011 (UTC)

Have you put some quotes around your title, or read it from a file, what's your command line exactly please? JackPotte 09:43, 26 March 2011 (UTC)

My regular command is "pagefromfile.py - file:test.txt - start:xxxx - end:yyyy -summary:nč" (but this will only use headlines, that are in bold text). Is it possible to add extra parameter in command line, so that bot would include also disambig in article's title, when posting articles? Regards, --Klemen Kocjančič (Talk - Fast reply) 18:33, 26 March 2011 (UTC)

Page created with the path of the API as the name[edit]

I am using the bot to create pages but the pages instead of being created with the name I specify are being created with the path of the API as the title.

For example: API Path = /wiki/api.php Input file:

xxxx
'''Album:8BP001'''
{{:Album:Colors|Renderer={{{Renderer|}}}}}
yyyy

Commands

pagefromfile.py -start:xxxx -end:yyyy -file:import.xml -notitle

Page Title = Wiki/api.php

69.168.52.106 I had the same problem. Then I upgraded to the latest version of MediaWiki (1.18) and the problem no longer happened.

85.211.55.11 19:49, 6 May 2011 (UTC)

Feature Request: Update only if Different[edit]

Hello! I'm using this script to automatically update several (>80) pages that I watch. I'm interested in an option to only perform a page update if the new markup is different from the existing markup. I realize this is probably computation-intensive, but I'd be interested in having it available as an option. --130.126.229.82 01:37, 17 May 2011 (UTC)

Encoding[edit]

I have problem with encoding. I create textual file and saved it using UTF-8 encoding, but I still get strange characters instead of Cyrilic charaters. Where is the problem? (btw, im using MS-DOS)-- Bojan  Talk  12:18, 15 June 2011 (UTC)

Title with two or more words[edit]

«The pagename will be the first headword in your textfile (the first word between ''' '''). It is currently automatically inserted in the uploaded page.» I would like add some articles about people, so between the ''' ''' and as pagename would be '''Firstname Lastname'''. I tried to write with underline, it worked half, as the page became '''Firstname–Lastname''' instead of '''Firstname_Lastname'''.

Config.py is "console_encoding = 'utf-8'" and the text editor I use to write the document.txt is "Unicode (UTF-8)", so it should work well together, but also vowels with accents are in trubble. Any ideas? --Eruedin (talk) 19:09, 17 July 2012 (UTC)

No changes were needed on pagename?[edit]

Why it says this?--維基小霸王 (talk) 15:12, 18 October 2016 (UTC)