黑馬殺出！淘天視頻模型登頂第一！

最近AI視頻圈直接被一匹黑馬攪得天翻地覆。

今天，一個叫 HappyHorse-1.0 的模型突然出現在 Artificial Analysis Video Arena盲測榜單上。

結果當天就直接包攬了 Text-to-Video、Image-to-Video 兩個榜單的第一名，而且不管帶不帶音頻，它都是穩穩的第一（昨晚看還是第二來着）。

這裏我們先簡單說說這個評測平臺。

Artificial Analysis Video Arena 是目前公認的權威的AI視頻第三方盲測榜單，完全匿名投票，只看最終生成的視頻質量，不看參數、不看公司背景。

能在這裏全榜第一，足見其含金量。

HappyHorse-1.0 的出現，讓人忍不住感嘆AI視頻技術的進步速度又加快了。

確實，前段時間大家還驚豔於 Seedance 2.0 出色的畫面效果和鏡頭控制能力，感嘆其估計一段時間內沒對手。

結果阿里淘天，突然殺出這麼一匹黑馬，直接把所有榜單都拿下了。

目前，Artificial Analysis 也放出了一些 demo，並將其與 Seedance 2.0、可靈 3.0 Pro、Grok-video-imagine 以及 PixVerse V6 拉出來一起對比測試：

此部分移步我們寫的原文觀看

Prompt：A hula hoop spinning on a kids waist, gradually climbing to their chest, then dropping to knees, then clattering to the floor. They pick it up to try again.

Prompt：A golf ball in a cup rolling around the rim three times before finally dropping in. The golfer's body language matches each rotation. Audio: Ball rattle, exhale, plop.

Prompt：A cat staring at its own reflection in a toaster, paw tapping the chrome surface. The distorted cat reflection taps back. Audio: Paw taps, confused meow.

Prompt：A barista creating latte art by pouring steamed milk into espresso. The white milk submerges beneath the brown crema initially, then breaks through the surface as the cup fills. The barista's wrist makes precise oscillating movements, creating a rosetta pattern. The milk and espresso maintain their distinct colors while interacting at the boundary. Audio: The gentle pour of liquid, the hiss of the steam wand in the background.

從 demo 就可以看出，在畫面一致性、運動自然度、物理真實性、提示詞遵循度這些核心維度上，幾乎是全面領先的。

且更難得的是，不管是純文本生成視頻，還是帶圖像參考生成視頻，甚至需要同步輸出音頻，它的表現都保持在第一梯隊。

這在目前公開的視頻生成模型裏，屬於非常均衡且頂尖的水準。

當然，於我個人而言，除了上述的這些維度，其實還有一個很重要的點——美學，就目前來看，我認爲 HappyHorse 在美學上的水平跟其他幾家相比還是差了點意思，特別是高爾夫和呼啦圈這兩個 demo，雖然指令遵循得很好，但是在觀感上和 Seedance2.0 以及 PixVerse V6 比還是差了點意思，缺少故事感。