Joshua currently implements a suffix array grammar extraction framework (based on Lopez, 2008) similar to that present in Hiero. There are a number of speed enhancements described in Lopez (2008) that are not currently implemented in Joshua. This project would involve enhancing existing Joshua code to speed up the grammar extraction process.