F-Measure in BuildAnalytics


What is a good F-measure?

F-Measure scores range from 0-100%. A score less than 15% means that your KantanMT engine is not performing optimally and a high level of post-editing will be required to finalise your translations and reach publishable quality.

A score greater than 70% is a good score and significantly less post-editing will be required to achieve publishable translation quality.

Improving F-Measure Score

There is a high correlation between the number of words used in training a KantanMT engine and its F-Measure score. Put simply, the more training data that is uploaded to KantanMT, the better the F-Measure score and, consequently, the generated translations. Also try uploading more terminology or using GapAnalysis to expand the vocabulary of the engine



Using F-measure in BuildAnalytics

To see the accuracy and performance of your engine click on the ‘F-measure Scores’ tab in Build Analytics. You will now be directed to the ‘F-measure Scores’ page.


Place your cursor on the ‘F-measure Scores Chart’ to see the individual score of each segment. A pop-up will appear with details of the segment under these headings: ‘Segment no.’, ‘Score’, ‘Source’, ‘Reference/Target’ and ‘KantanMT Output’.



To see the ‘F-measure Scores’ of each segment in a table format scroll down. You will now see a table with the headings ‘No’, ‘Source’, ‘Reference/Target’, ‘KantanMT Output’ and ‘Score’.

To see an even more in depth breakdown of a particular ‘Segment’ click on the Triangle beside the number of the segment you wish to view.


To reuse the engine as Test Data click on the ‘Reuse as Test Data’. When you do so, the ‘Reuse as Test Data’ button will change to ‘Delete Test Data’.


To download the ‘F-measure Scores’, of all segments click on the ‘Download’ button on  the ‘F-measure Scores’ page.



Have more questions? Submit a request