Accessible Document Converter Solution

VerseOne's Accessible Document Converter (ADC) is designed to make it easier to convert PDF and Word documents to Accessible HTML web pages. ADC is not designed to faithfully display heavily designed documents as though viewing in a graphic design application or even a PDF — this is why the module includes a link to download the original document.

Although continually being improved, HTML is fundamentally more limited than even most Word Processors, e.g. there is no concept of "tiered numbering" in lists: we can make it look as though there are by using CSS, but this will not help those using screen-readers, for instance.

How the converter works

Our conversions are provided by two services: PDF to Word uses Adobe’s professional PDF Services API, and the Word to HTML is handled by a library called PanDoc. So, a PDF will go through Adobe first, and then through PanDoc: Word only goes through PanDoc.

Once we receive the HTML, we can do a number of transforms ourselves — to make up for some translations errors where possible, and to ensure that we don’t have multiple identical images, etc. This does allow us to finesse some elements of the HTML, provided we have some way of determining the original data.

Although we are constantly trying to improve the module, some elements are beyond our control. The below outlines known issues with the ADC, short-term workarounds, and any development progress that we have made or are researching.

Back to the list

Vector Images

Browsers only gained the ability to render vectors in the last few years, and they use a format called Scalable Vector Graphics (SVG) which, whilst similar in form to Illustrator / InDesign’s Encapsulated PostScript (EPS), is not exactly the same. And if you are copying / pasting from one to the other, I’m not sure how those are saved out.

We have, however, identified a library setting that may be able to preserve vector images as SVG, and we will be testing that in our current Sprint. We will let you know whether this has worked. I will also be posting any updates to this page