Tagged with

4 articles found

Microsoft's VibeVoice 1.5B Can Generate 90-Minute Podcasts With 4 Voices

Microsoft's new open-source TTS model can synthesize feature-length audio with multiple speakers, but comes with audible disclaimers and watermarking to prevent misuse.

#text-to-speech#microsoft#ai...

text-to-speech

VibeVoice's Uncanny Valley: Microsoft's 90-Minute AI Podcasts Sound Too Human

Microsoft's VibeVoice model can generate 90-minute multi-speaker podcasts that blur the line between synthetic and human speech, raising ethical questions about audio deepfakes.

#text-to-speech#microsoft#ai...

CPU-First AI: BitDistill Enables High-Performance LLMs Without GPUs

With 2.65x faster CPU inference, BitDistill signals a potential shift toward CPU-efficient AI deployment, reducing reliance on expensive GPU infrastructure.

#ai#llm#cpu-inference...

Why Your AI Assistant Needs a Bad Attitude

Microsoft's UserLM-8b flips the script by training AI to think like messy, inconsistent humans instead of perfect assistants.

#AI#UserLM#Microsoft...

Navigation

Categories

Tagged with

Microsoft's VibeVoice 1.5B Can Generate 90-Minute Podcasts With 4 Voices

VibeVoice's Uncanny Valley: Microsoft's 90-Minute AI Podcasts Sound Too Human

CPU-First AI: BitDistill Enables High-Performance LLMs Without GPUs

Why Your AI Assistant Needs a Bad Attitude