A few years ago, a new kind of AI called a diffusion model appeared. Today, it powers tools like Stable Diffusion and Runway Gen-2, turning text prompts into high-quality images and even short videos.
We’re all pretty familiar with AI’s ability to create realistic-looking images of people that don’t exist, but here’s an unusual implementation of using that technology for a different purpose: ...
Alibaba’s EMO (or Emote Portrait Alive) framework is a recent entry in a series of attempts to generate a talking head using existing audio (spoken word or vocal audio) and a reference portrait image ...
As LLMs and diffusion models power more applications, their safety alignment becomes critical. Our research shows that even minimal downstream fine‑tuning can weaken safeguards, raising a key question ...
Stability AI has announced Stable Diffusion 3, the latest and most powerful version of the company’s image-generating AI model. While details are scant, it’s clearly an attempt to fend off the hype ...
With so much money flooding into AI startups, it’s a good time to be an AI researcher with an idea to test out. And if the idea is novel enough, it might be easier to get the resources you need as an ...