Does OmegaT creates search indexes for tm?
Thread poster: Руслан И.

Руслан И.
Russian Federation
Local time: 12:50
Turkish to Russian
+ ...
Nov 29, 2018

When I import new TM into OmegaT in form of large tmx files, OmegaT seems to lag for some time.
Question is, does OmegaT creates search indexes?

Second question, is it better to split large tmx into smaller ones?

My main issue is low OmegaT performance with large tm files.


 

Didier Briel  Identity Verified
France
Local time: 08:50
English to French
+ ...
No search index Nov 29, 2018

Руслан И. wrote:
When I import new TM into OmegaT

OmegaT doesn't import them, it just loads them (i.e., it doesn't transform them into anything else).

in form of large tmx files, OmegaT seems to lag for some time.
Question is, does OmegaT creates search indexes?

No.
The lag time you are seeing is the time needed to load the TMX, or an issue with memory.

Second question, is it better to split large tmx into smaller ones?

It shouldn't really make a difference.

My main issue is low OmegaT performance with large tm files.

Are you sure you have allocated enough memory to OmegaT?

You can check the memory available in Help > About.

If not, you are likely to experience performance issues.

To increase memory, see:
https://omegat.sourceforge.io/manual-standard/en/chapter.installing.and.running.html#launch.command.arguments

Didier


 

Руслан И.
Russian Federation
Local time: 12:50
Turkish to Russian
+ ...
TOPIC STARTER
Solved Nov 29, 2018

Thank you for your reply! Very clear answer!

 

tcordonniery
France
Local time: 08:50
Lucene Indexes for OmegaT Jan 22

Hi Ruslan, sorry that I did not see this message before

I experimented the possibility to create indexed translation memories for OmegaT using Apache Lucene: this library is already present in OmegaT (used for tokenizers) but I added the possibility to use it for indexation, which is its initial goal.

You can download a patched version of OmegaT here:
http://www.silvestri
... See more
Hi Ruslan, sorry that I did not see this message before

I experimented the possibility to create indexed translation memories for OmegaT using Apache Lucene: this library is already present in OmegaT (used for tokenizers) but I added the possibility to use it for indexation, which is its initial goal.

You can download a patched version of OmegaT here:
http://www.silvestris-lab.org/node/43
and to learn how to use Lucene indexes:
http://www.silvestris-lab.org/node/26

As far as I could test, these indexes have a speed comparable with TMX loaded in memory (i.e. search is sometimes faster, sometimes slower), except that they do not need to be loaded: you can have gigabytes of such files without expanding the memory used by OmegaT. So it is definitively useful for large files but useless for smaller ones.

This is still experimental, use it at your own risk. Don't hesitate to make a report about your tests here.

Note: this cannot be done as a plugin actually because OmegaT does not provide an API for translation memories (they provide API only for machine translation, which is not the same thing), that is why it looks like a fork. But if you are interested, we can try to convince the core team to include it.

Regards
Thomas
Collapse


 


There is no moderator assigned specifically to this forum.
To report site rules violations or get help, please contact site staff »


Does OmegaT creates search indexes for tm?

Advanced search






Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

More info »
PerfectIt consistency checker
Faster Checking, Greater Accuracy

PerfectIt helps deliver error-free documents. It improves consistency, ensures quality and helps to enforce style guides. It’s a powerful tool for pro users, and comes with the assurance of a 30-day money back guarantee.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search