Happy Horse 1.1 Tutorial: Text to Video and Image to Video
Share this post:
Happy Horse 1.1 Tutorial: Text to Video and Image to Video
Happy Horse 1.1 by Alibaba is now available on AI FILMS Studio. It is the updated version of Happy Horse 1.0, with a significantly lower 1080p credit rate. At 1080p, Happy Horse 1.1 costs 945 credits for a 5 second clip compared to 1,400 credits on 1.0, a reduction of roughly a third with no change to quality or capabilities.
What Is Happy Horse 1.1
Happy Horse 1.1 is Alibaba's updated cinematic video generation model. It produces 720p and 1080p video from text prompts or reference images, with fluid motion dynamics, stable subject rendering, and precise prompt adherence.
Text-to-video supports five aspect ratios: 16:9, 9:16, 1:1, 4:3, and 3:4, with clip lengths from 3 to 15 seconds. Image-to-video animates a reference image into a cinematic clip while preserving the composition and identity of the source. Both modes bill at the same per second rate.
Text to Video
Step 1: Open the workspace and select your model
Go to AI FILMS Studio. In the video generator panel, use the Select Generation Type dropdown to choose Text to Video, then open the Select Model dropdown and choose Happy Horse 1.1.

Step 2: Configure your generation
Write your prompt in the text field, then set your resolution, aspect ratio, and duration.
Prompt: Describe the scene, motion, camera movement, and lighting. Up to 2,500 characters. Specific camera instruction produces better results: "slow dolly push toward the subject as fog rolls across the floor" outperforms "camera moves forward".
Resolution: 720p for drafts and iteration, 1080p for final deliverables.
Aspect ratio: 16:9 (landscape), 9:16 (portrait), 1:1 (square), 4:3 (classic), or 3:4 (portrait).
Duration: 3 to 15 seconds. Start with 5 seconds when testing a new prompt.
Seed: Optional. A fixed seed reproduces the same output when all other parameters stay the same. Leave blank for a random result on each run.

Step 3: Generate and view output
Click Create. The video renders and appears in the output panel. Download it directly from the workspace.
Text to Video in the Nodes Graph
Happy Horse 1.1 text-to-video is available as a node in the AI FILMS Studio Nodes Graph Editor. Connect a Prompt node to the Text to Video node set to Happy Horse 1.1, then wire the output to a Video Viewer or Result node. This lets you chain the model into automated pipelines alongside image generation, audio, or enhancement nodes.
Image to Video
Image-to-video animates a reference image into a cinematic clip. The model preserves the composition, lighting, and subject identity of your source image while applying smooth motion and camera movement. The output aspect ratio matches the input image.
Supported input formats: JPEG, PNG, BMP, WEBP. Maximum file size: 10 MB. Minimum dimension: 300 pixels on any side.
Step 1: Switch to image to video mode
In the generation type dropdown, select Image to Video.

Step 2: Select Happy Horse 1.1
Open the model dropdown and choose Happy Horse 1.1.

Step 3: Upload your reference image
Upload your source image. Options include Upload Image, Previous Task, and Image URL.
For best results, use a high contrast image where the subject is clearly defined. Avoid extreme aspect ratios. The model works within a 1:2.5 to 2.5:1 ratio range.

Step 4: Write your prompt (optional)
Describe the motion and camera movement you want. The model already knows what your image looks like. Focus on what changes: "she turns slowly toward the camera, light shifting across her face as wind moves through the curtain behind her." Prompts can be up to 2,500 characters.

Step 5: Choose resolution and duration
Select 720p or 1080p. Choose a duration between 3 and 15 seconds. A 5 second clip covers most motion tests before committing to a longer generation.

Step 6: Set a seed (optional)
Enter a seed number to lock the animation to a specific variation. The same image, prompt, and seed combination produces the same output on repeat runs. Leave blank for a random result.

Step 7: Review the credit cost
The interface shows the exact credit cost for your selected resolution and duration before you submit. Confirm the settings are correct before generating.

Step 8: Generate and view output
Click Create. The animated video appears in the output panel ready to download.
Image to Video in the Nodes Graph
Happy Horse 1.1 image-to-video is also available as a node in the AI FILMS Studio Nodes Graph Editor. Connect an Image Upload or Image from Task node and a Prompt node to the Image to Video node set to Happy Horse 1.1, then wire the output to a Video Viewer or Result node.
Credit Costs
Both text-to-video and image-to-video bill at the same per second rate. Happy Horse 1.1 reduces 1080p costs by roughly a third compared to Happy Horse 1.0 while keeping 720p pricing identical.
| Resolution | 3 seconds | 5 seconds | 10 seconds | 15 seconds |
|---|---|---|---|---|
| 720p | 420 credits | 700 credits | 1,400 credits | 2,100 credits |
| 1080p | 567 credits | 945 credits | 1,890 credits | 2,835 credits |
Credits for failed generations are automatically refunded. For subscription and credit details, visit the AI FILMS Studio pricing page.
Prompt Tips for Happy Horse 1.1
Specify camera movement explicitly. "Slow dolly push toward the subject" produces a distinct result from "the camera moves forward". Happy Horse 1.1 follows cinematographic direction for dolly shots, pans, cranes, and rack focuses.
Use 720p and short durations for iteration. Generate at 720p for 3 to 5 seconds to validate motion and composition. The 33% lower 1080p cost in 1.1 makes it more practical to go to 1080p sooner for final deliverables.
Anchor the visual style. Add descriptors like "photorealistic", "film still", or "editorial" to set the visual register alongside motion instructions.
For image-to-video, describe the motion. The model already has your image. "Close up portrait of a woman" adds nothing. "She turns slightly to the left, eyes following something off frame" gives the model specific motion to execute.
For more detailed prompt guidance and use case examples, the Happy Horse 1.0 tutorial covers the same techniques in depth.
Sources
Alibaba
Continue Reading
Video & LipSync
- Video Generator
- Text to Video
- Image to Video
- Start-End Frame to Video
- Draw to Video
- Motion Control
- Video Enhancer
- Video Upscaler
- Video to Video LipSync
- Audio to Video LipSync
- Image to Video LipSync
- Video FaceSwap
- Seedance 2
- Vidu Q3 Pro
- Google Veo 3.1
- Kling 3.0 Pro
- LTX 2.3
- Happy Horse 1.0
- Kling 3.0 Motion
- ByteDance Upscaler
- InfiniteTalk
- InsightFace

