FLORA Docs
FLORA Docs
  • Welcome to the Garden
  • Getting Started
    • Quickstart
    • FLORA Concepts
      • Manifesto
      • Our Product Philosophy
  • BLOCKS
    • Block Overview
    • Text Block
      • Text to Text
      • Image to Text
      • Video to Text
    • Image Block
      • Text to Image
      • Image to Image
    • Video Block
      • Text to Video
      • Image(s) to Video
  • Editor
    • Editor Overview
    • Canvas
    • Toolbar
    • Navigation
    • Collaboration & Share
  • HOW TO...
    • Styles
    • Unsplash
  • COMMUNITY
    • Community
Powered by GitBook
On this page
  • Summary
  • Images to Video Models
  • Parameters
  • Images to Video​
  • How to use
  1. BLOCKS
  2. Video Block

Image(s) to Video

PreviousText to VideoNextEditor Overview

Last updated 29 days ago

Summary

Image-to-Video models generate videos by taking a single image or multiple images as input, using the input image(s) to guide scene continuity, motion, and transitions, resulting in dynamic video content that expands the static visuals into a moving narrative while maintaining visual consistency.

Images to Video Models

Models
Description
Best For
Supported Parameters

Hailuo Minimax

Advanced multimodal model with strong contextual understanding. Good at interpreting complex prompts and maintaining narrative consistency.

Long-form videos with detailed storylines and character continuity.

/

Google's Veo2's text-to-video model offers high cinematic quality and dynamic motion. It excels in rendering smooth transitions and diverse motion dynamics but may struggle with scene coherence in complex scenarios.

Creating cinematic videos with realistic motion and high-quality output.

Aspect Ratio: Landscape(16:9), Portrait(9:16).

Duration: 5s, 6s, 7s, 8s.

An Alibaba's open-source video generation model, available in 14 billion and 1.3 billion parameter versions. It supports text-to-video, image-to-video, and video editing tasks.

Creating videos with specified motion paths from images.

Offers cinematic-grade video generation with complex camera movements like zooms and pans. Supports mixed image-text-video inputs and over 60 artistic styles.

Creating cinematic videos with complex camera movements and mixed inputs.

Aspect Ratio: Landscape(16:9), Portrait(9:16), Square (1:1).

Kling Standard 1.6

Balanced model focused on quality-to-speed ratio. Offers good visual fidelity with reasonable generation times.

Quick prototyping and general-purpose video creation with moderate detail requirements.

Duration: 5s, 10s.

Kling Pro 1.6

Specializes in photorealistic rendering with advanced lighting and physics simulation.

Product demos, architectural visualizations, and realistic human movements.

Duration: 5s, 10s.

Kling Pro 1.5

A cinematic camera with smooth camera motion and high visual fidelity. Optimized for cinematic output, supporting nuanced prompts and detailed scene rendering.

Creating concept visualizations, mood films, and animated storytelling from stills.

Duration: 5s, 10s.

A large-scale video generative model capable of creating realistic visuals with natural, coherent motion.

Generating realistic videos with coherent motion from text and images.

Aspect Ratio: Landscape(16:9, 4:3), Wide Landscape(21:9), Portrait(9:21, 9:16, 3:4).

Resolution: 540p, 720p, 1080p.

Duration: 5s, 9s.

Loop: Yes/No.

Luma Ray 2 Flash

A variant of Luma Ray 2 model optimized for faster and more cost-effective generation.

Quick generation of short, realistic videos.

Aspect Ratio: Landscape(16:9, 4:3), Wide Landscape(21:9), Portrait(9:21, 9:16, 3:4).

Resolution: 540p, 720p, 1080p.

Duration: 5s, 9s.

Loop: Yes/No.

Great for character animation and emotional expression. Good at maintaining consistent subjects across scenes.

Character-driven narratives, explainer videos with avatars, and emotional storytelling.

Aspect Ratio: Landscape(16:9, 3:2, 5:4), Portrait(9:16, 2:3, 4:5), Square(1:1).

Resolution: 720p, 1080p.

Duration: 5s, 10s.

Loop: Yes/No.

Runway Gen-4 Turbo

Runway’s fastest text-to-video and image-to-video model, capable of generating high-fidelity 2-second clips in near real-time. It builds on Gen-4’s photorealism and motion consistency with drastically reduced latency, enabling interactive creative workflows.

Largest open-source text-to-video model with 13 billion parameters. innovative video-to-audio synthesis for realistic sound generation.

Global marketing campaigns, localized content, and videos requiring cultural nuance.

Styles: None, 3D Character, Anime, Close-up;

Resolution: 480p, 720p, 1080p; Pro Mode(Higher quality video generation): On/Off; Duration: 5s, 2.5s.

Lightricks LTXV

Good for maintaining smooth transitions between frames, reducing flickering and scene inconsistencies.

Generating dynamic video content quickly for storyboards and animatics with fluid scene transitions.

Quality (Increase the quality of the output): 0-100.

Parameters

These are parameters that are applicable to all our base models.

Parameter
Type
Effect on Output

Prompt

Text

The text prompt is processed through the selected model.

Seed

Seed

The seed is a deterministic number that indexes generations from the model. It's typically randomized, but you can set a seed if there's a particular output you're looking for! Keep in mind that all parameters must be the same in order for a given seed's output to persist.

The Images to Video node generates a video by connecting multiple frames. You can input up to 9 frames and the model will fill in the gaps! It primarily uses IP-Adapter based morphing techniques that's happening in the backend.

You can interact with it in a number of ways:

  • Input images by clicking the upload button in the node itself

  • Connect any image output to the node and it'll populate the images section

  • Reorder or delete images once populated

  • Use the prompt to help guide the output!

How to use

Here are some example workflows using Text to Video in our community page:

Images to Video

​
Timelapse Video Creation
Floral Framed to Video Introduction
Veo 2
WAN 2.1
Kling 2.0
Luma Ray 2
Pika
Tencent Hunyuan