Word Count for EXcel document Different form with SDL
Thread poster: irene yang
irene yang
irene yang
China
Mar 4, 2014

Hi,
I have calculated the words count of an Excel document with SDL 2011 first, the total words count is 147972, and the translator offered the total words is 149703 with using OmegaT. Then I calculated the words count with this formula "=SUM(IF(LEN(TRIM(D2:D26))=0,0,LEN(TRIM(D2:D26))-LEN(SUBSTITUTE(D2:D26," ",""))+1))" in Excel to calculate and got the number 133987.
The difference is so huge, can anyone tell me which one is more accurate?
OmegaT version is 2.6.3_07.
Th
... See more
Hi,
I have calculated the words count of an Excel document with SDL 2011 first, the total words count is 147972, and the translator offered the total words is 149703 with using OmegaT. Then I calculated the words count with this formula "=SUM(IF(LEN(TRIM(D2:D26))=0,0,LEN(TRIM(D2:D26))-LEN(SUBSTITUTE(D2:D26," ",""))+1))" in Excel to calculate and got the number 133987.
The difference is so huge, can anyone tell me which one is more accurate?
OmegaT version is 2.6.3_07.
Thank you!
Collapse


 
Didier Briel
Didier Briel  Identity Verified
France
Local time: 14:52
English to French
+ ...
There is no accurate counting, there are different methods Mar 4, 2014

irene yang wrote:

I have calculated the words count of an Excel document with SDL 2011 first, the total words count is 147972, and the translator offered the total words is 149703 with using OmegaT.


Given the huge numbers, I don't find the difference between SDL 2011 and OmegaT to be large. I'm not sure for SDL 2011 but perhaps, as Trados 2007, SDL 2011 does not count numbers as words, which could explain the main difference.

OmegaT count "tokens", i.e., it counts as words what is considered a word by Java (in version 2.6).


Then I calculated the words count with this formula "=SUM(IF(LEN(TRIM(D2:D26))=0,0,LEN(TRIM(D2:D26))-LEN(SUBSTITUTE(D2:D26," ",""))+1))" in Excel to calculate and got the number 133987.
The difference is so huge, can anyone tell me which one is more accurate?
OmegaT version is 2.6.3_07.

If I understand correctly, you count words by counting the number of spaces between the words. Depending on the text and on the source language, your counting method may or may not be accurate. (And I'm no Excel specialist, so I cannot tell just by reading your formula whether it works well or not.)
To get a reasonable idea, I would copy/paste your Excel text into Word, and compare with the count in Word. Generally, OmegaT and Word counts are close.

Didier


 


There is no moderator assigned specifically to this forum.
To report site rules violations or get help, please contact site staff »


Word Count for EXcel document Different form with SDL






TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »
Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »