• @Kusimulkku@lemm.ee
    link
    fedilink
    926 days ago

    Mozilla’s Common Voice seems pretty cool, but I’m not sure if that counts.

    It’s fun to record the clips.

    • ArchRecord
      link
      fedilink
      English
      225 days ago

      I’ve contributed to labeling and scoring some of the Common Voice data before. Definitely a fun little thing to do when you have some free time.

      I was also pretty happy when I saw Open Assistant making a fully public, consensually contributed to database for text models, but they unfortunately shut down, and in the end there was only really enough data to fine-tune models rather than creating one from scratch.