Talk:Reading/Web/PDF Functionality

Jump to navigation Jump to search

About this board

About giving feedback

Update: (15 July 2019) We’ve launched the new PDF renderer. We’re looking at feedback, but haven't so far seen any significant issues. We might incorporate some suggestions, but want to note that this is not an ongoing project with continuous development. In other words, now that it's deployed and proven to work, the new renderer is entering maintenance mode. This page won’t be abandoned, but it could take a while before anyone reacts, simply because everyone's got so much else to do.  

In terms of books, we've left it in the hands of volunteer developers and PediaPress. We'll be glad to reach out to them with questions, but we're not planning any involvement in terms of the technical implementation.

Book maker formats in Persian language

Mobin2008 (talkcontribs)

Hi. I noticed that the book maker is not working well in Farsi.

For example, instead of this


is writing:

م ا ل س

With respect.‍‍‍

Reply to "Book maker formats in Persian language"
Omotecho (talkcontribs)
Reply to "Failed rendering"
Dirk Hünniger (talkcontribs)
Steelpillow (talkcontribs)

Looks good. I do not have time to test it myself, but it is crystal clear that the WMF need to wake up and smell the coffee.

Your main code may be written in haskell, but how is that worse than a proprietary volunteer working on a (very) occasional basis and keeping their code closed source?

Dirk Hünniger (talkcontribs)
Reply to "Rendering Collections fast" (talkcontribs)

Hola, ¿habrá posibilidad de exportar a epub? Gracias

Johan (WMF) (talkcontribs)
Reply to "Formato epub para exportar"

Please install additional font on the pdf services for CJK characters.

Viztor (talkcontribs)

Default font looks terrible. Source Hans Sans + Source Hans Serif would be a great option. All it takes is to install the font set.

Tgr (WMF) (talkcontribs)
Reply to "Please install additional font on the pdf services for CJK characters."

-Extended periodic table (detailed cells)- download PDF not working well

Andy181209 (talkcontribs)
Steelpillow (talkcontribs)

In a word, No. The table is ridiculously huge and cannot possibly be condensed into a meaningful page-based format.

Or, to put it another way, the table would have to be broken down at source into a multi-page friendly format first.

Dirk Hünniger (talkcontribs)

I tried with and the result looks really funny. If you got the okular pdf viewer you can zoom into the table at 1600 % and on a UHD screen you can actually read some elements, still it prints over the margin of the page. So yes I tried my best, but a satisfying result seems unlikely to achieve.

Reply to "-Extended periodic table (detailed cells)- download PDF not working well"

merci pour ce information

Summary by Johan (WMF)

Just a thanks. (talkcontribs)

je vous j'adore

Pb avec le téléchargement PDF

2A01:CB00:865C:AE00:B9:47D3:8F37:7153 (talkcontribs)

Très simple : vous n'avez qu'à aller sur la "version imprimable" et demander que le texte soit imprimer en version pdf.... Cela fonctionne sans aucun souci .En tout cas, sur Mac...

Reply to "Pb avec le téléchargement PDF"
Dirk Hünniger (talkcontribs)


I could generate PDF versions of all community maintained books in the English Wikipedia and store them in a cloud accessible with sftp. I could update each PDF once a year. We could link form the book template to the cloud with lua. Are you interested?

Yours Dirk

Dirk Hünniger (talkcontribs)

So, I started a minimal setup now, 5 years old dual core laptop next to my fridge, with one process at a time. The first three resulting pdfs are available here . I need about 1 hour per pdf and each has got a size of 20 MByte on average. Since I got roughly 6000 pdf to make. This will take about 250 days an use about 120 GByte of disk space.

Sj (talkcontribs)

Sounds like a great candidate for a rapid grant for a dedicated faster computer.

Dirk Hünniger (talkcontribs)

I think the computer is not so much of a problem at the moment. I can afford the necessary hardware and I am quite happy if I don't have to do more paperwork. But what I actually think I need is a community decision on the English Wikipedia that says that I can run a bot to upload the pdfs on the English Wikipedia. I started the discussion here: . I think the 20 PDF files I generated up to now are enough to come to a decision on that.

Reply to "Book Creation -> Cached Books"
HNRSoftware (talkcontribs)

Looking down this thread, I see complaints about lack of funding. How much is needed? and how can I contribute some so that it goes to this project and not just into WMF coffers?

Johan (WMF) (talkcontribs)

I mean, "thank you" sounds so insufficient whenever I see posts like these.

But let's be honest here: it's not that the WMF lacks money but that it feels it needs to spend it on other things. That there are always things we'd like to do but we can't because our resources are limited.

But our resources are always going to be limited, and there's simply no way to donate money to areas that aren't prioritised, because there's an organisational cost to doing things, keeping track of things. The new single-page renderer was deployed today! So if that's what you're looking for, hopefully we've solved the main issues. If you're talking about books, my suggestions would be to get involved with the volunteer developers working on that and see how/if you can help them, but money to the WMF will go to general support for and development of the Wikimedia wikis.

Steelpillow (talkcontribs)

There are really two projects here.

The single-page pdf download has just been replaced by a new tool and is at least functional, though improvements would be nice.

The book rendering is being written on an unpaid volunteer basis by PediaPress. They are a commercial company and it remains unresolved as to how open the codebase will be. Progress has stopped for several months now. One way to move it forward might be to contact them direct and offer to pay for the work, however watch that licensing. Another way might be to support the WMF but they don't seem to care about the licensing, and as you can see targeting funds will be hard. Or you could contact Dirk Hunniger who has written an independent and open-source renderer of his own which appears to be functional but he has no funds to set up a sufficiently powerful server. Dirk will probably read this and reply.

I hope this helps.

HNRSoftware (talkcontribs)

Kind-of what I had guessed. Every so often I contribute to WMF, just because it is a great concept and (eventually) the whole internet may work more cooperatively. I'm a retired software engineer and don't have serious web skills, so I doubt if I could contribute skill-wise to the pdf book project. One thing I do not see here is a design document for pdf functionality. Scanning down this thread, I can derive most of the decision points, but it is really not clear precisely what the vision is. Yes, wiki articles to pdfs and assembling pdfs into printable "books", but the varying styles of related articles will make it very difficult to integrate the articles into a smooth book without manual intervention at some point in the process.

My personal interest is consolidating SciFi author (of interest to me, not all possible authors) articles, and some other things like Raspberry Pi microcomputers and similar things. These would probably be better served with a bookmark organizer than a pdf book creator, although consolidating to pdf has a lot of attraction. One of the key significances to pdf articles is that the links are "live", and I can actually use the pdf as a starting point for further reading.

Dirk Hünniger (talkcontribs)

Well I can not accept any funding either. I am a government employee and not supposed to have any other income. If you want to set up a server for yourself its just as easy as installing ubuntu and

sudo apt-get install mediawiki2latex mediawiki2latex -s 80

The point why I don't set up a publicly reachable server is not so much a lack of funding but more the laws made by the govermernt I am working for. There is quite a high risk that I spend the rest of my live in prison if I do so. The law is called Vorratdatenspreicherung.

Dirk Hünniger (talkcontribs)

If you really got too much money and want to spend it on PDF Tool development, you could hire Henning Thielemann (PhD). He is an experienced Haskell freelancer and has worked on mediawiki2latex before. He recently told me, that he is still interested in working on it, provided that funding is available.

Sj (talkcontribs)

Sounds to me as though Henning could apply for a rapid grant to get a dedicated server to run book-to-pdf batches every month. Then if that works, he could apply for a normal project grant for extensions/expansion.

Reply to "Funding needed?"