fosstodon.org is one of the many independent Mastodon servers you can use to participate in the fediverse.
Fosstodon is an invite only Mastodon instance that is open to those who are interested in technology; particularly free & open source software. If you wish to join, contact us for an invite.

Administered by:

Server stats:

9.8K
active users

#voiceai

0 posts0 participants0 posts today

After my #wake_word_detection #research has delievered fruits, I have plans to continue works in the voice domain. I would love if I could train a #TTS model which has #British accent so I would use it to practice.

I was wondering if I could do the inference on #A311D #NPU. However, as I am skimming papers of different models, having inference on A311D with reasonable performance seems unlikely. Even training of these models on my entry level #IntelArc #GPU would be painful.

Maybe I could just finetune an already existing models. I am also thinking about using #GeneticProgramming for some components of these TTS models to see if there will be better inference performance.

There are #FastSpeech2 and #SpeedySpeech which look promising. I wonder how much natural their accents will be. But they would be good starting points.

BTW, if anyone needs opensource models, I would love to work as a freelancer and have an #opensource job. Even if someone can just provide access to computation resources, that would be good.

#forhire #opensourcejob #job #hiring

The article gushes over HiTTS.cc, a contraption that turns text into "humanlike" voices using GPT-4o, but let's be honest, it's more like Alexa's awkward cousin attempting Shakespeare 🎭. At least you can spend your time picking from a buffet of cringe-inducing voice names like "Echo" and "Onyx" 🤖. Enjoy your journey into the uncanny valley, brave soul! 🚀
hitts.cc #HiTTScc #GPT4o #texttospeech #uncannyvalley #voiceAI #cringe #HackerNews #ngated

hitts.ccHiTTS.cc - Advanced Text to Speech with GPT-4o mini TTSExperience revolutionary Text to Speech technology powered by OpenAI's GPT-4o mini TTS model.