Monday, July 16, 2012

Most of rules can be succefully converted

All this time I manipulated with English rules of LanguageTool. You can find my script here and obtain XML files here. The end result of this script gives right now following output:
67.471 % of rules covered (587/870)
23 rules left to cover
260 unsupported rules
When you run the script you will get the text file named unsupported.txt and it contains all unsupported rules. Most of them is "postag" rules. It's not easy to convert them to LightProof. So it requires expanding LightProof API features to go. I was planned to hack it after midterm evaluation. You can find more about LanguageTool XML API here. Also there are some very small number of corrections needed like scope, skip, match attributes and tags. However it's almost done.

Will be continued.

No comments:

Post a Comment