Similar use cases to what I’m doing right now, running LLMs like Mixtral8x7B (or something better by the time we start seeing these), Whisper (STT), or Stable Diffusion.
I use a fine tuned version of Mixtral (dolphin-Mixtral) for coding purposes.
Transcribing live audio for notes/search, or translating audio from different languages using Whisper (especially useful for verifying claims of translations for Russian/Ukrainian/Hebrew/Arabic especially with all of the fake information being thrown around).
Combine the 2 models above with a text to speech system (TTS), a vision model like LLaVA and some animatronics and then I’ll have my own personal GLaDOS:
https://github.com/dnhkng/GlaDOS
And then there’s Stable Diffusion for generating images for DnD recaps, concept art, or even just avatar images.
Removed by mod
Similar use cases to what I’m doing right now, running LLMs like Mixtral8x7B (or something better by the time we start seeing these), Whisper (STT), or Stable Diffusion.
I use a fine tuned version of Mixtral (dolphin-Mixtral) for coding purposes.
Transcribing live audio for notes/search, or translating audio from different languages using Whisper (especially useful for verifying claims of translations for Russian/Ukrainian/Hebrew/Arabic especially with all of the fake information being thrown around).
Combine the 2 models above with a text to speech system (TTS), a vision model like LLaVA and some animatronics and then I’ll have my own personal GLaDOS: https://github.com/dnhkng/GlaDOS
And then there’s Stable Diffusion for generating images for DnD recaps, concept art, or even just avatar images.
Removed by mod