Method Overview
ChuniWorld is built around four core properties —
agency, persistence, durability, and responsiveness.
Agency
Two control channels: a rendered 3D cache with lightweight AdaLN camera modulation for grounded, trajectory-aware navigation, and chunk-level prompt switching to introduce new events mid-generation.
Persistence
World-consistent memory — an explicit 3D cache reprojected to the queried view for spatial recall, plus a compressed frame-history embedding for temporal continuity, so revisited places stay recognizable.
Durability
Long-horizon stability from training on drifted histories and an error bank that re-injects accumulated artifacts into both memory and target, preventing errors from compounding over minute-long rollouts.
Responsiveness
Real-time interaction via few-step DMD distillation and short temporal chunks, with prompt switching at chunk boundaries to minimize both visual and semantic latency.
Demo Results
Diverse Style Video Generation
Same scene, seven styles — drag the handles to compare, switch the scene below.
Jiangnan Town
Green Village
Realistic
Oil Painting
Ink Wash
Cyberpunk
Zelda
Minecraft
Pixel
Camera-Controllable Video Generation
Real-time 6-DoF camera control with an on-screen joystick — use the arrows to browse 16 clips.
Game-Style Video Generation
Worlds generated in real-game and synthesized game styles — use the arrows to browse 16 clips.
Prompt-Driven Interaction
Drop a prompt mid-scene to trigger an event — same world, different spells. Use the arrows to browse.
‹
Scene A misty Jiangnan water town — whitewashed houses and stone bridges mirrored in the canal.
Prompt Fireworks burst over the rooftops.
Scene A misty Jiangnan water town — whitewashed houses and stone bridges mirrored in the canal.
Prompt A red-and-black magic circle erupts across the ground.
Scene A misty Jiangnan water town — whitewashed houses and stone bridges mirrored in the canal.
Prompt A giant tree crashes down from the sky.
Scene A misty Jiangnan water town — whitewashed houses and stone bridges mirrored in the canal.
Prompt A huge white bear is summoned into the street.
Scene A sunny green meadow, a dirt path winding toward distant mountains.
Prompt A fireball detonates in a burst of flame.
Scene A sunny green meadow, a dirt path winding toward distant mountains.
Prompt A blazing phoenix is summoned into the sky.
Scene A sunny green meadow, a dirt path winding toward distant mountains.
Prompt The whole world darkens into night.
Scene A sunny green meadow, a dirt path winding toward distant mountains.
Prompt A towering purple monster is summoned.
Scene A snowy mountain valley rendered in a soft oil-painting style.
Prompt A skeleton rises from the snow.
Scene A sunlit cobblestone avenue leading to the Roman Colosseum.
Prompt A giant ice hammer slams onto the plaza.
Scene A traditional Japanese courtyard framed by crimson autumn maples.
Prompt The buildings crumble and collapse.
Scene A Japanese shrine with a torii gate under a grey sky.
Prompt Flames engulf the shrine.
Scene A dreamlike frozen fantasy world glittering in blue light.
Prompt A wand wave makes flowers bloom across the ice.
Scene A colorful Mediterranean game-style town by the sea.
Prompt A fiery phoenix is summoned overhead.
Scene A famous white Japanese castle framed by blooming cherry blossoms.
Prompt A massive explosion erupts.
Scene Desert pyramids glowing under a golden sunset.
Prompt Green water floods and spreads across the sand.
›
World-Consistent Video Generation
Turn left and right, then look back — does the world stay consistent? Same trajectory, four methods.
‹
ChuniWorld (Ours)
DreaxX-World
lingBot-Fast
HY-World 1.5
ChuniWorld (Ours)
DreaxX-World
lingBot-Fast
HY-World 1.5
ChuniWorld (Ours)
DreaxX-World
lingBot-Fast
HY-World 1.5
ChuniWorld (Ours)
DreaxX-World
lingBot-Fast
HY-World 1.5
›
Long-Horizon Video Generation
Over a minute of continuous, drift-free generation per clip — all shown at 2× speed.
Team
ALAYA Lab
Within each role category, authors are listed in alphabetical order by their first names.
Core Lead:
Kaipeng Zhang
Lead:
Chuanhao Li
Core Contributor:
Chuanhao Li, Kaipeng Zhang, Yifan Zhan, Yongtao Ge, Yuanyang Yin
Contributor:
Jiaming Tan, Kang He, Liaoyuan Fan, Ruicong Liu, Xiaojie Xu, Xuangeng Chu, Zhen Li, Zhengyuan Lin, Zhixiang Wang, Zian Meng, Zihui Gao