Pages in topic:   [1 2] >
CAT tool to translate scanned documents?
Thread poster: cristina villanueva
cristina villanueva
cristina villanueva  Identity Verified
Spain
English to Spanish
+ ...
Dec 3, 2012

Does anyone know a CAT tool which can be used to translate scanned documents? Trados does not allow it.

Thanks a lot
Cristina


 
Tony M
Tony M
France
Local time: 06:16
Member
French to English
+ ...
SITE LOCALIZER
AFAIK there isn't one Dec 3, 2012

...at least not directly!

If you are referring to PDF documents, then you may wish to search for the various previous discussions on the subject of converting PDF > DOC, which is usually the process you need to go through before then translating using your CAT.

The same thing holds good for documents in image formats, and usually some kind of OCR processing will be necessary in order to recover editable text.

I believe I read somewhere the other day that on
... See more
...at least not directly!

If you are referring to PDF documents, then you may wish to search for the various previous discussions on the subject of converting PDF > DOC, which is usually the process you need to go through before then translating using your CAT.

The same thing holds good for documents in image formats, and usually some kind of OCR processing will be necessary in order to recover editable text.

I believe I read somewhere the other day that one of the on-line CAT tools does actually offer this facility, but I have no idea how (or if!) it works.
Collapse


 
Heinrich Pesch
Heinrich Pesch  Identity Verified
Finland
Local time: 07:16
Member (2003)
Finnish to German
+ ...
Impossible without scanning Dec 3, 2012

It is not for a CAT tool to scan images, there is dedicated software for this step.

 
Siegfried Armbruster
Siegfried Armbruster  Identity Verified
Germany
Local time: 06:16
English to German
+ ...
In memoriam
Get some training on PDF conversion Dec 3, 2012

e.g.

webinar: Converting PDF files into a workable format for translators

For info and registration see http://gxplanguageservices.wordpress.com/webinar


 
Egils Grikis
Egils Grikis  Identity Verified
United Kingdom
Local time: 05:16
Russian to Latvian
+ ...
jpeg to word or pages Dec 3, 2012

Take a look into this discussion: jpeg to word or pages

http://eng.proz.com/forum/apple_mac_operating_systems/238507-jpeg_to_word_or_pages.html

cristina villanueva wrote:

Does anyone know a CAT tool which can be used to translate scanned documents? Trados does not allow it.

Thanks a lot
Cristina


 
Gerard de Noord
Gerard de Noord  Identity Verified
France
Local time: 06:16
Member (2003)
English to Dutch
+ ...
Wordfast Anywhere Dec 3, 2012

You can give Wordfast Anywhere a try. It's free.
http://www.freetm.com/

Cheers,
Gerard


 
Tony M
Tony M
France
Local time: 06:16
Member
French to English
+ ...
SITE LOCALIZER
But there are snags... Dec 3, 2012

Gerard de Noord wrote:

You can give Wordfast Anywhere a try.


I just did, and I have to say I was so not impressed.

It couldn't produce anything at all from an image-based PDF file (which I think is what Cristina was asking about), and even from a nice, clean DOC > PDF converted file, its OCR results were to say the least — unusable!

[Edited at 2012-12-03 21:31 GMT]


 
esperantisto
esperantisto  Identity Verified
Local time: 07:16
Member (2006)
English to Russian
+ ...
SITE LOCALIZER
Different experience Dec 4, 2012

Tony M wrote:
…and even from a nice, clean DOC > PDF converted file, its OCR results were to say the least — unusable!


My experience is opposite: I fed a PDF file to WFA once or twice, and the results were surprisingly good. Those were, however, files with quite simple layout. However, lack of control over the very recognition process is a minus. I prefer to manually markup pages to recognize.


 
Dominique Pivard
Dominique Pivard  Identity Verified
Local time: 07:16
Finnish to French
setting the language in Wordfast Anywhere Dec 4, 2012

Tony M wrote:
Gerard de Noord wrote:
You can give Wordfast Anywhere a try.

I just did, and I have to say I was so not impressed.
It couldn't produce anything at all from an image-based PDF file (which I think is what Cristina was asking about), and even from a nice, clean DOC > PDF converted file, its OCR results were to say the least — unusable!

Did you create a dummy TM in which the source language matched the language of the PDF you wanted to OCR? This allows the OCR engine to use a dictionary for that language, when it encounters words that cannot be read easily.
Can you post the image-based PDF for which you obtained unusable results?

You may want to have a look at this:

http://wordfast.fi/blog/cat-tools/2011/10/12/converting-a-dead-pdf-to-word-with-wordfast-anywhere/


 
Heinrich Pesch
Heinrich Pesch  Identity Verified
Finland
Local time: 07:16
Member (2003)
Finnish to German
+ ...
Java run-time error Dec 4, 2012

Dominique Pivard wrote:

Tony M wrote:
Gerard de Noord wrote:
You can give Wordfast Anywhere a try.

I just did, and I have to say I was so not impressed.
It couldn't produce anything at all from an image-based PDF file (which I think is what Cristina was asking about), and even from a nice, clean DOC > PDF converted file, its OCR results were to say the least — unusable!

Did you create a dummy TM in which the source language matched the language of the PDF you wanted to OCR? This allows the OCR engine to use a dictionary for that language, when it encounters words that cannot be read easily.
Can you post the image-based PDF for which you obtained unusable results?

You may want to have a look at this:

http://wordfast.fi/blog/cat-tools/2011/10/12/converting-a-dead-pdf-to-word-with-wordfast-anywhere/


I tried this out with a pdf I had converted easily with Finereader and translated in WFC, but after up-load WFA reported Java Run-time error.
So I have to take back my statement above "impossible", but a pdf-converter as part of a translation-environment tool is only as good as its weakest part.


 
Dominique Pivard
Dominique Pivard  Identity Verified
Local time: 07:16
Finnish to French
Problem reported to the WFA list Dec 4, 2012

Heinrich Pesch wrote:
I tried this out with a pdf I had converted easily with Finereader and translated in WFC, but after up-load WFA reported Java Run-time error.

I confirm there's a problem with the server right now. I reported it to the WFA list. Hopefully it will be fixed sooner than later.


 
Yasmin Moslem
Yasmin Moslem  Identity Verified
Egypt
Local time: 06:16
English to Arabic
FineReader OCR Online Dec 4, 2012

Dear Colleagues,

You can use FineReader OCR Online:
http://finereader.abbyyonline.com/

Convert the file and then translate it in the tool of your choice.

HTH,
Yasmin


 
Tom45 (X)
Tom45 (X)
Local time: 06:16
Scanned images? Dec 10, 2012

cristina villanueva wrote:

Does anyone know a CAT tool which can be used to translate scanned documents? Trados does not allow it.

Thanks a lot
Cristina


Is this what you need: translating an image?

http://tinyurl.com/d8qamwe


 
AllegroTrans
AllegroTrans  Identity Verified
United Kingdom
Local time: 05:16
Member (2011)
French to English
+ ...
FINEREADER ONLINE Jan 10, 2013

I tried this service once but found there is very little control over the process. It failed to reproduce diagrams properly.
I invested in ABBY Finereader (cost me less than 70 euros) and it is well worth it.


 
Sarah McDowell
Sarah McDowell  Identity Verified
Canada
Local time: 23:16
Member (2012)
Russian to English
+ ...
ABBYY finereader Jan 10, 2013

AllegroTrans wrote:

I tried this service once but found there is very little control over the process. It failed to reproduce diagrams properly.
I invested in ABBY Finereader (cost me less than 70 euros) and it is well worth it.


Where can you find this for less than 70 euros? I have only seen this available for much higher prices.


 
Pages in topic:   [1 2] >


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

CAT tool to translate scanned documents?







Trados Studio 2022 Freelance
The leading translation software used by over 270,000 translators.

Designed with your feedback in mind, Trados Studio 2022 delivers an unrivalled, powerful desktop and cloud solution, empowering you to work in the most efficient and cost-effective way.

More info »
CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

Buy now! »