This paper introduces a novel framework for a text-to-game engine that leverages foundation models to transform simple textual inputs into intricate, multimodal RPG experiences.
The transition from professionally generated content (PGC) to user-generated content (UGC) has reshaped various media formats, encompassing formats such as text and video. With rapid advancements in generative AI, a similar transformation is set to redefine the gaming industry, particularly within the domain of role-playing games (RPGs).
The engine dynamically generates game narratives, integrating text, visuals, and mechanics, while adapting characters, environments, and gameplay in real time based on player interactions. To evaluate and demonstrate the feasibility and versatility of this framework, we developed the ‘Zagii’ game engine.
Zagii has successfully powered hundreds of RPG games across diverse genres and facilitated tens of thousands of online gameplay sessions, showcasing its scalability and adaptability. These results highlight the framework’s effectiveness and its potential to foster a more open and democratized approach to game development.
Our work underscores the transformative role of generative AI in reshaping the gaming lifecycle and advancing the boundaries of interactive entertainment.