PDF Editor
Thread poster: swiss solutions
swiss solutions
swiss solutions  Identity Verified
Romania
Local time: 12:10
Member (2006)
English to Romanian
+ ...
Jan 16, 2009

Hello, everyone

What would be the PDF editing software that you would recommend purchasing? Lately, we have large amounts of PDF source documents that need to be translated. Though we have a PDF to Word soft, it rarely transforms PDF to Word documents at best quality. On the other hand, I must add that we do our best to get an editable format from the client but it almost never happen....
See more
Hello, everyone

What would be the PDF editing software that you would recommend purchasing? Lately, we have large amounts of PDF source documents that need to be translated. Though we have a PDF to Word soft, it rarely transforms PDF to Word documents at best quality. On the other hand, I must add that we do our best to get an editable format from the client but it almost never happen.

Thank you!

Have a nice day!
Collapse


 
Kevin Lossner
Kevin Lossner  Identity Verified
Portugal
Local time: 10:10
German to English
+ ...
Do it right Jan 16, 2009

If I understood your message correctly, you already have OCR software, though you didn't specify which. The conversion quality issues are very often dependent on how you use the software; most of the people I have seen use it, do so in a way that is not at all conducive to high-quality translation work, especially with CAT tools.

There is no "PDF editing software" to recommend for the situation you describe. Learn how to do OCR correctly, or if your OCR software is not very good, ge
... See more
If I understood your message correctly, you already have OCR software, though you didn't specify which. The conversion quality issues are very often dependent on how you use the software; most of the people I have seen use it, do so in a way that is not at all conducive to high-quality translation work, especially with CAT tools.

There is no "PDF editing software" to recommend for the situation you describe. Learn how to do OCR correctly, or if your OCR software is not very good, get better software and learn to use it. Take a look at the "How To" tab on my profile, and you'll find some old instructions for post-OCR workflow (using Abbyy FineReader as an example) that will help you improve the quality of your results. To this I would add getting hold of Dave Turner's "CodeZapper" macro collection to clean up the resultant RTF/Word files and eliminate superfluous codes/tags better. (These macros can be found in the download area of the dejavu-l list on Yahoogroups and elsewhere - they are just as useful for TagEditor users as for DVX users.)
Collapse


 
Miroslav Jeftic
Miroslav Jeftic  Identity Verified
Local time: 11:10
Member (2009)
English to Serbian
+ ...
Yup Jan 16, 2009

I agree that there's no "PDF software" to recommend here. As for the good OCR solution, it's either Abbyy FineReader or Nuance OmniPage, nothing else comes near I think.

 
Sergei Leshchinsky
Sergei Leshchinsky  Identity Verified
Ukraine
Local time: 12:10
Member (2008)
English to Russian
+ ...
OCR manually !!! Jan 16, 2009

Never delegate tasks to "automatic modes" if you are interested in the results. The process is:
1) FineReader + manual segmentation,
2) formatting of the source in Word before translation,
3) CAT,
4) formatting after translation,
5) export to PDF (use DoPDF: i) free and ii) better rendering of images, though iii) bigger files).


 
Paola Dentifrigi
Paola Dentifrigi  Identity Verified
Italy
Local time: 11:10
Member (2003)
English to Italian
+ ...
Abby Jan 16, 2009

It saved my life, as most of the translations I get are .pdf.

 
Marina Aleyeva
Marina Aleyeva  Identity Verified
Israel
Local time: 12:10
Member (2006)
English to Russian
+ ...
Solid Converter - more or less ok Jan 17, 2009

I assume from your post that you are looking for a better PDF to Word solution than what you have had. Solid Converter is ok for less sophisticated tasks. Unlike many other tools including paid ones, it can tell lines from paragraphs and headers/footers from the rest of the document. It will also recognize formatting and graphics. However, the quality of what you eventually get in Word is far from ideal, particularly where the layout is tricky. Quite a lot of manual formatting is still required,... See more
I assume from your post that you are looking for a better PDF to Word solution than what you have had. Solid Converter is ok for less sophisticated tasks. Unlike many other tools including paid ones, it can tell lines from paragraphs and headers/footers from the rest of the document. It will also recognize formatting and graphics. However, the quality of what you eventually get in Word is far from ideal, particularly where the layout is tricky. Quite a lot of manual formatting is still required, and complicated tables turn into a mess.Collapse


 
José Henrique Lamensdorf
José Henrique Lamensdorf  Identity Verified
Brazil
Local time: 06:10
English to Portuguese
+ ...
In memoriam
PDF Editor Jan 17, 2009

Kevin Lossner wrote:
There is no "PDF editing software" to recommend for the situation you describe. Learn how to do OCR correctly, or if your OCR software is not very good, get better software and learn to use it.


Actually there is PDF editing software, InFix, from http://www.iceni.com , but it only works with PDF files generated from applications, aka "distilled". For scanned PDFs, OCR is the only way out.


 
Kevin Lossner
Kevin Lossner  Identity Verified
Portugal
Local time: 10:10
German to English
+ ...
Forget it Jan 17, 2009

José Henrique Lamensdorf wrote:
Actually there is PDF editing software, InFix, from http://www.iceni.com , but it only works with PDF files generated from applications, aka "distilled". For scanned PDFs, OCR is the only way out.


Here are the product claims:



Faster PDF Translation

Use Infix to provide faster translations for your clients. Copy phrases and paragraphs direct from the PDF, translate them using your favourite system then paste them directly into the PDF. Take care of any formatting issues such as overrunning text there and then.

* Translate the text within the PDF directly
* Make text formatting changes as you translate the text
* Ensure original layout of brochures, catalogues is maintained 100%
* Avoid conversions to intermediate formats
* Open two copies of the same document side by side



Though interesting, the software is useless for most translation purposes. The only real use I can see for it is minor touch-up of a PDF headed for print and extremely simple translation jobs of simple text blocks without CAT tools.

In multi-column layouts the text blocks are not linked. In my tests copying and pasting between InFix and MS Word, formatting features like spacing between paragraphs were lost. The idea of trying to copy a large text back over the original paragraph-for-paragraph and adjust the text blocks for changes in the text length, perhaps move graphic elements (like signatures) to adjust to layout changes, etc. - no thanks. The claim of "faster PDF translation" is a sad joke. Faster than what? I could see using this tool to do a simple one-page flyer quickly, but for anything else, the additional trouble is simple not worth it.

There really is no substitute for a good OCR program. As I have pointed out before (in discussions, articles, etc.), one can make OCR services part of one's business and earn good money. The first year I started doing this I added several thousand euros to our income with hourly charges or surcharges to translation line rates and word rates for OCR preparation. That more than pays for the few hundred (or maybe less) for a good OCR program. Even if you don't charge for OCR services, the improvement in efficiency for dealing with PDF files of all kinds is worth the investment.


 
Brandis (X)
Brandis (X)
Local time: 11:10
English to German
+ ...
With José Jan 20, 2009

Hi!
This works only on file scans, you do not need a CAT for translation, it is a direct solution, maintains the format. Document or page scans require generally OCR, here I find Abby and IRIS are two competing products. Both are good though, Abby is most widely used in our profession as I observe from various other postings. BR Brandis


 
swiss solutions
swiss solutions  Identity Verified
Romania
Local time: 12:10
Member (2006)
English to Romanian
+ ...
TOPIC STARTER
Thank you all for your helpful suggestions Jan 23, 2009

Thank you all for your advice and suggestions. Mr. Lossner, I read your articles related to OCR workflow and found them very helpful.

Thank you all!


Brandis wrote:

Hi!
This works only on file scans, you do not need a CAT for translation, it is a direct solution, maintains the format. Document or page scans require generally OCR, here I find Abby and IRIS are two competing products. Both are good though, Abby is most widely used in our profession as I observe from various other postings. BR Brandis


 


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Fernanda Rocha[Call to this topic]

You can also contact site staff by submitting a support request »

PDF Editor






Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »
TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »