Get Involved

Mailing List

The main forum for communication on Moses is the Moses support mailing list.

Suggestions

We'd like to hear what you want from Moses. We can't promise to implement the suggestions, but they can be used as input into research and student projects, as well as Marathon projects. If you have a suggestion/wish for a new feature or improvement, then either report them via the issue tracker, contact the mailing list or drop Barry or Hieu a line (addresses on the mailing list page).

Development

Moses is an open source project that is at home in the academic research community. There are several venues where this community gathers, such as:

Moses is being developed as a reference implementation of state-of-the-art methods in statistical machine translation. Extending this implementation may be the subject of undergraduate or graduate theses, or class projects. Typically, developers extend functionality that they required for their projects, or to explore novel methods. Let us know if you made an improvement, no matter how minor. Also let us know if you found or fixed a bug.

Use

We are aware of many commercial deployments of Moses, for instance as described by TAUS. Please let us know if you use Moses commercially. Do not hesitate to contact the core developers of Moses. They are willing to answer questions and may be even available for consulting services.

Contribute

There are many ways you can contribute to Moses.

Projects

If you are looking for projects to improve Moses, please consider the following list:

Front-end Projects

Training & Tuning

https://www-i6.informatik.rwth-aachen.de/publications/download/668/Wuebker-ACL-2010.pdf

Chart-based Translation

http://www.dfki.de/~davi01/papers/vilar11:search.pdf
This project proposes to quantify the advantages and disadvantages of scope-0 grammar. A scope-0 grammar lacks application ambiguity, therefore, decoding can be fast and memory efficient. However, this must be offset against potential translation quality degradation due to the lack of coverage.
It may be that the advantages of a scope-0 grammar can only be realized through specifically developed algorithms, such as parsing algorithms or data structures. The phrase-table lookup for a Scope-0 grammar can be significantly simplified, made faster, and applied to much large span width.
This project will also aim to explore this potentially rich research area.
http://aclweb.org/anthology//W/W12/W12-3150.pdf
http://www.sdl.com/Images/emnlp2009_tcm10-26628.pdf

Phrase-based Translation

More ideas related to reordering:
http://www.spencegreen.com/pubs/green+galley+manning.naacl10.pdf
http://www.transacl.org/wp-content/uploads/2013/07/paper327.pdf
https://www-i6.informatik.rwth-aachen.de/publications/download/444/Zens-COLING-2004.pdf
https://www-i6.informatik.rwth-aachen.de/publications/download/896/Feng-ACL-2013.pdf
http://research.google.com/pubs/archive/36484.pdf
http://research.google.com/pubs/archive/37163.pdf
http://research.google.com/pubs/archive/41651.pdf
http://research.google.com/pubs/archive/38279.pdf
  1. A losing hypothesis which have been recombined with the winning hypothesis may now be the new winning hypothesis. The output search graph has to be reordered to reflect this.
  2. The feature functions in the 2nd pass produce state information. Recombined hypotheses may no longer be recombined and have to be split.
  3. It would be useful for feature functions scores to be able to be evaluated asynchronously. That is, a function to calculate the score it called but the score is calculated later. Skills required - C++, NLP, Moses. (GSOC)

General Framework & Tools

Engineering Improvements

Documentation