The NCCR Evolving Language offers the following datasets as open source downloads:

