TM from previously translated documents
Thread poster: Hoshi-Trans
Hoshi-Trans
Hoshi-Trans
United Kingdom
Local time: 04:38
English to Japanese
Mar 12, 2011

Hi,

is it possible to generate translation memory automatically from documents that had been previously translated in Word or Excel? (Source and target available)

Or is it necessary to create a new project, import source and paste the translation one by one?

Thanks in advance for any adivce


 
Didier Briel
Didier Briel  Identity Verified
France
Local time: 05:38
English to French
+ ...
What you need is an aligner Mar 12, 2011

Hoshi-Trans wrote:
is it possible to generate translation memory automatically from documents that had been previously translated in Word or Excel? (Source and target available)

The process of creating a translation memory from existing translations is called an alignment, which is done with an aligner.

There are several open-source aligners.
See Other OmegaT resources and
Third-party software on OmegaT's website.

For Excel, it's generally quite easy to create a translation memory, since there is a one to one match between the source and target document.


Or is it necessary to create a new project, import source and paste the translation one by one?

You could do it that way, but you should be faster with an aligner.

Didier


 
Miloš Švantner
Miloš Švantner  Identity Verified
Slovakia
Local time: 05:38
German to Slovak
+ ...
LF Aligner Mar 12, 2011

I can recommend you LF Aligner, works on Windows, Mac and Linux as well.
It's really fast and accurate.
All you need to know you'll find in the LF_aligner_readme.


 
Samuel Murray
Samuel Murray  Identity Verified
Netherlands
Local time: 05:38
Member (2006)
English to Afrikaans
+ ...
Use an aligner Mar 12, 2011

Hoshi-Trans wrote:
Is it possible to generate translation memory automatically from documents that had been previously translated in Word or Excel?


Use an aligner, such as: http://bligner.org/


 
Same situation but different Jun 29, 2011

I am in the same boat except I've already got everything aligned perfectly. I've got thousands of segments sitting uselessly in Excel. I've tried what must be a dozen different software packages to convert the files but they inevitably turn my lovely Arabic-scripted Dari into complete gibberish despite supposedly being encoded in UTF-8. Any ideas?

 
FarkasAndras
FarkasAndras  Identity Verified
Local time: 05:38
English to Hungarian
+ ...
TMX maker Jun 30, 2011

Just generate a TMX from your data.
This is how I would do it:
Put the data in columns A and B of an xls. Put metadata (e.g. a reference to the source of the text) in column C. Each line of text can have different metadata. Copy-paste it all into a txt file. Save the txt file in UTF-8 encoding with Save as.
Use the TMX maker in the LF Aligner package to generate a TMX. This conserves the metadata as a text field.

Ar
... See more
Just generate a TMX from your data.
This is how I would do it:
Put the data in columns A and B of an xls. Put metadata (e.g. a reference to the source of the text) in column C. Each line of text can have different metadata. Copy-paste it all into a txt file. Save the txt file in UTF-8 encoding with Save as.
Use the TMX maker in the LF Aligner package to generate a TMX. This conserves the metadata as a text field.

Article:

http://www.proz.com/translation-articles/articles/3194/1/Converting-translation-memories-into-spreadsheets-and-vice-versa

Download: www.sourceforge.net/projects/aligner

As long as you save your txt as UTF-8, the Arabic characters should be fine in the TMX. The TMX will be encoded in UTF-8, and the text will essentially just be copied over to the TMX without modification. You can check this by opening the TMX in a text editor. Of course your problem might be in the program you're importing the TMX into.
Collapse


 


There is no moderator assigned specifically to this forum.
To report site rules violations or get help, please contact site staff »


TM from previously translated documents






Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

Buy now! »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »