bleetube on Nostr: https://bitcoiner.social/static/attachments/4rG1Kg_tony--d6_trim.mp4 I had way too ...
I had way too much with LivePortrait this morning. Couldn't quite get it to run on Nix (I still suck at nix derivations), but they're still running a free demo space on HF.
Also out of China in the last few weeks was ChatTTS and Qwen2, state of the art local text to speech and llm. I've been running it with ollama and open-webui, and it's not half bad compared to 4o or claude, in one prompt it actually gave me a better answer.
There's a ComfyUI node for LivePortrait. There's a lot you could do with this stuff. Deepfakes and destroy democracy, sure. But maybe more relevant to my interests is we're nearly upon the toolset needed to build a fully artificial (and local) talking head. if there's a model that can generate facial expressions to match a voice, I think that's basically everything we need is right there. then use ollama to RAG your own documents and legit talk to your home PC face to face.
{
"id":"1ba919565e72bcf563fb434916448f9dd118550daa75ec779c01e1b8ef0e802d",
"pubkey":"69a0a0910b49a1dbfbc4e4f10df22b5806af5403a228267638f2e908c968228d",
"created_at":1721244984,
"kind":1,
"tags": [
[
"r",
"https://bitcoiner.social/static/attachments/4rG1Kg_tony--d6_trim.mp4"
]
],
"content":"https://bitcoiner.social/static/attachments/4rG1Kg_tony--d6_trim.mp4\n\nI had way too much with LivePortrait this morning. Couldn't quite get it to run on Nix (I still suck at nix derivations), but they're still running a free demo space on HF.\n\nAlso out of China in the last few weeks was ChatTTS and Qwen2, state of the art local text to speech and llm. I've been running it with ollama and open-webui, and it's not half bad compared to 4o or claude, in one prompt it actually gave me a better answer.\n\nThere's a ComfyUI node for LivePortrait. There's a lot you could do with this stuff. Deepfakes and destroy democracy, sure. But maybe more relevant to my interests is we're nearly upon the toolset needed to build a fully artificial (and local) talking head. if there's a model that can generate facial expressions to match a voice, I think that's basically everything we need is right there. then use ollama to RAG your own documents and legit talk to your home PC face to face.\n\nwhat a time to be alive",
"sig":"755eada76fa5b2dfccba3b853a26979b3aab07436043c7608849e895b21170f08844344e0abcd79e1c563e317525149519aabed5b98959b948c2273e5837b8b1"
}