AI API Tools — apiai.me

Add border

Programmatically wrap your images with clean, custom-colored outer borders or padding. Perfect for standardizing e-commerce thumbnails or social media frames.

View API → Image Editing and Cropping

Add image on image

Seamlessly overlay watermarks, brand logos, or secondary layers onto your base images. Features precision pixel positioning and customizable dropshadow effects.

View API → Video

Banner on video

Level up your video pipeline by embedding pre-designed lower thirds, promotional banners, or call-to-actions directly onto your video frames.

View API → Background Removal

an AI-powered, state-of-the-art, and enterprise-safe background removal solution developed by BRIA AI. It is designed to identify and isolate the primary subject (people, products, or objects) from its background with high accuracy, often used for e-commerce, advertising, and creative workflows.

View API → Image Filters

Brightness & Contrast

Fine-tune the exposure and tonal punch of your images. Easily correct dark, underexposed shots or amp up contrast to make your visuals pop.

View API → Visual Intelligence

Check Resolution

A fast gatekeeper for asset ingestion. Instantly verifies if an uploaded image matches your required width and height constraints, returning a quick boolean.

View API → Visual Intelligence

Check Transparency

Scans your media files to confirm whether an alpha channel (transparency) is present, helping you route files before layer composition.

View API → Image Editing and Cropping

Crop with padding

Extract specific coordinates or regions of an image while safely maintaining a customizable buffer of breathing room (padding) around your subject.

View API → Image Editing and Cropping

Detect and Crop

Leverage object detection to automatically locate the primary subject within an image and crop tightly around it—no manual coordinate configuration required.

View API → Video

Dremina Seedance 2.00

a second-generation, multimodal AI video generation model released in early 2026. It allows creators to generate cinematic-quality videos up to 15 seconds long (typically 5-12 seconds) using text, image, video, and audio inputs. It is highly regarded for its strong temporal consistency, meaning characters and objects maintain their appearance across frames, and its ability to handle complex camera movements like pans, tilts, and zooms

View API → Visual Intelligence

Extract Colors

Analyze your visuals to extract the dominant color palette and corresponding hex codes. Great for generating matching UI themes or auditing brand compliance.

View API → Customer Scripts

Fabric Swap / Material Swap

Fabric Swap — Replace the fabric, leather, or material on any product photo with a single API call. Upload your product image, a mask, and a swatch; we tile the swatch, transfer the product's lighting and shadows, and return a photorealistic mockup. Perfect for furniture, apparel, upholstery, and ecommerce product personalization. No AI, sub-second response, deterministic output.

View API → Image Editing and Cropping

Fancy text on images

Render dynamic, beautifully styled typography onto your visuals. Perfect for generating programmatic marketing banners, memes, or personalized user graphics.

View API → Image Editing and Cropping

Flip and Mirror

Instantly reverse your visuals horizontally or vertically to correct camera orientation or create unique symmetrical mirror effects.

View API → Image Generation

Flux 2 Max

AI image generation model, designed for maximum performance, highest editing consistency, and superior prompt adherence. It specializes in high-fidelity, production-grade visuals, making it ideal for advertising, top-tier e-commerce, and cinematic, high-resolution (up to 4MP) content

View API → Image Generation

Flux Fill Pro

Professional inpainting and outpainting model with state-of-the-art performance. Edit or extend images with natural, seamless results.

View API → Utilities

Format Converter

Optimize delivery or maximize legacy compatibility by seamlessly transcoding images between PNG, JPEG, modern WebP, or lossless BMP

View API → Video

Format Converter Video

Convert video formats to MP4

View API → Image Filters

Gaussian Blur

Apply a smooth, math-driven blur to mask sensitive user data, soften background clutter, or create elegant depth-of-field effects.

View API → Visual Intelligence

Gemini 2.5 Flash Lite

Our most cost-efficient multimodal model, offering the fastest performance for high-frequency, lightweight tasks. Gemini 2.5 Flash-Lite is best for high-volume classification, simple data extraction, and extremely low-latency applications where budget and speed are the primary constraints.

View API → Visual Intelligence

Gemini 3.1 Flash Lite Preview

Get early access to Google's next-generation, ultra-fast multimodal model. Optimized for sub-second visual analysis, high-frequency data extraction, and high-volume classification at a fraction of the cost.

View API → Utilities

Google Upscaler

Advanced image enhancement system that increases the resolution of low-quality, small, or compressed images by 2x or 4x, transforming them into high-definition visuals.

View API → Text Generation

Grok 3 mini

Lightweight, cost-efficient reasoning model from xAI, designed for high-speed performance in coding, math, and logic tasks. It features a 131K-token context window and provides transparent "chain-of-thought" reasoning, allowing users to trace its logic, with options for low or high reasoning effort

View API → Video

Grok Imagine Video

Transform descriptive text prompts into high-fidelity, fluid video clips leveraging xAI’s flagship generative video architecture. Perfect for cutting-edge social media content and motion design.

View API → Visual Intelligence

Grounding Dino Auto Detect

zero-shot, open-vocabulary object detection model that combines Transformer-based DINO detectors with grounded pre-training to detect objects using natural language prompts. It acts as an automated detection tool that does not require labeled datasets to identify objects, making it highly effective for auto-labeling images and rapid prototyping

View API → Visual Intelligence

Image Metadata Extractor

Peek under the hood of any media file. Instantly extract technical data including dimensions, file format, DPI, and deeply buried camera EXIF metadata.

View API → Video

Image on video

Overlay branded watermarks, channel logos, or graphical frames seamlessly across any video timeline for a polished, television-ready look.

View API → Utilities

Image Resizer

Scale images up or down to precise pixel dimensions while keeping the aspect ratio safely locked or forcing custom constraints.

View API → Image Filters

Invert Colors

Flip your image pixels to their exact photographic negative counterparts—ideal for unique artistic filters or technical visualization styles.

View API → Video

Kling V2.5 Turbo Pro

Unlock pro-level text-to-video and image-to-video creation with smooth motion, cinematic depth, and remarkable prompt adherence.

View API → Image Editing and Cropping

Mask Checker

Checks a mask against its source image. Returns an overlay for visual review plus a pass/fail verdict. Can ▎ pass the mask through instead of the overlay

View API → Image Generation

Multi Image Kontext Pro

An advanced, experimental composition engine that intelligently references, blends, and merges contextual elements or styles from two distinct input images into a single cohesive visual.

View API → Image Generation

Nano Banana

a high-velocity AI image generation and editing model from Google DeepMind. It specializes in fast, high-volume image creation, accurate text rendering within images, and maintaining consistency across edited visuals

View API → Image Generation

Nano Banana 2

Nano Banana 2, formerly known as Gemini 3.1 Flash Image, is an AI image generation and editing model. It was released by Google in February 2026. It combines the capabilities of previous "Pro" models with the performance of Google’s Flash architecture.

View API → Image Generation

Nano Banana 2 lite

Our fastest, most cost-efficient image model in the Nano Banana family yet, built for high throughput, speed and scale.

View API → Image Generation

Nano Banana Pro

a state-of-the-art image generation and editing model built on Gemini 3 Pro, designed for professional asset production, high-fidelity visual design, and complex, multi-turn instruction following. It enables advanced features like native 2K/4K rendering, precise text generation within images, and consistent character creation across multiple scenes

View API → Image Editing and Cropping

Nano Banana Pro Inpainting

Powerful inpainting model run by Nano Banana Pro. Uses image, mask and optional reference image.

View API → Text Generation

OpenAI GPT-4o

capable of processing and generating text, audio, and images in real-time. It offers GPT-4 level intelligence with vastly improved speed, 50% cheaper API costs, and better vision/audio understanding

View API → Visual Intelligence

OpenAI GPT-4o-mini

Fast and cost-efficient image understanding (e.g. captioning, classification, extracting text from screenshots, answering factual questions about an image)

View API → Image Generation

OpenAI GPT image 1.5

OpenAI's flagship image generation and editing model, built directly into the GPT-5 architecture to provide faster, more precise, and production-ready visual generation. It serves as a significant upgrade over previous models, focusing on "region-aware editing"—the ability to modify specific parts of an image without changing the entire scene

View API → Visual Intelligence

OpenAI o1

Reasoning-focused model that internally "thinks" before responding. Stronger on multi-step logic and complex analysis

View API → Image Editing and Cropping

Place Image on Canvas

Place a cut-out image (transparent background) onto a fixed-size canvas. The subject is scaled to fill up to a chosen % of the canvas (keeping proportions), centred by default — align it to any edge and nudge it with offsets. Use it to normalise cut-outs to a consistent size and position, e.g. before a generative/edit step or for a uniform catalog look.

View API → Utilities

Real Esrgan Upscaler

Real-ESRGAN is an open-source AI-powered image restoration and super-resolution model designed to upscale low-resolution images by – while removing noise, compression artifacts, and restoring fine details. It is particularly effective for real-world images, including photos, anime, and graphics, often employing U-Net discriminators to create sharp, high-fidelity outputs.

View API → Background Removal

Recraft Remove Background

Automated background removal for images. Tuned for AI-generated content, product photos, portraits, and design workflows

View API → Utilities

Recraft Vectorize

Convert raster images to high-quality SVG format with precision and clean vector paths, perfect for logos, icons, and scalable graphics.

View API → Background Removal

Remove Solid Background

Removes solid color background by flood fill from edges. Supports Grounding DINO for protecting logo elements.

View API → Image Editing and Cropping

Rotate image

Programmatically spin or re-orient any image to a precise angle or standard 90/180/270-degree positions.

View API → Utilities

Sam3 Image

Harness Meta's Segment Anything 3 (SAM) framework for zero-shot, pixel-perfect object segmentation. Isolate items, generate masks, or track elements across frames using natural language or coordinate prompts.

View API → Image Generation

Seedream 4

Seedream 4.0 is a next-generation, high-performance multimodal AI image model by ByteDance that unifies image generation and editing within a single, fast architecture. It supports text-to-image, image-to-image, and multi-image reference, delivering up to 4K resolution with high consistency and precision for creative and professional applications

View API → Image Generation

Seedream 4.5

Seedream 4.5: Upgraded Bytedance image model with stronger spatial understanding and world knowledge

View API → Background Removal

Segmentation Background Removal Birefnet

Isolate subjects with high precision using the state-of-the-art BiRefNet model, flawlessly detaching intricate silhouettes from complex backgrounds.

View API → Image Filters

Sepia Tint

Wash your photos in a nostalgic, warm-toned sepia or map any custom monochromatic color overlay to instantly shift the brand mood.

View API → Image Filters

Sharpen

Crisp up soft or slightly blurry visuals using an advanced unsharp mask, pulling hidden details back into sharp focus.

View API → Utilities

Smooth mask

Advanced mask smoothing with dual-mode algorithm. 'Simple' mode uses uniform SDF blur for blob-like masks (upholstery, panels). 'Adaptive' mode preserves thin features while smoothing thick regions (furniture frames, complex shapes). Includes hole filling, speckle removal, corner compensation, and anti-aliased output ready for compositing.

View API → Image Editing and Cropping

Solid Crop

Eliminate wasteful dead space. Intelligently detects and strips away redundant white space or solid-colored background edges to keep the focus tight on your main object.

View API → Image Filters

Stealth Mode

Strip away color distractions. Convert any vibrant image into a striking, high-contrast black-and-white masterpiece.

View API → Image Editing and Cropping

Transparent Crop

The ultimate asset-prep utility for transparent PNGs/WebPs. Automatically normalizes your output format, centers the subject, and injects a precise pixel margin to keep assets looking uniform.

View API → Image Filters

Turn To Grey

Turn any Image into Stealth Mode (black-white) / Turn your images into a stunning black and white photo

View API → Video

Veo 3.0 fast

Veo 3.0 Fast Generate is Google's high-speed AI video model designed for rapid iteration, prototyping, and cost-efficient production. It is a variant of the standard Veo 3 model, optimized to generate 8-second video clips significantly faster—roughly in half the time—while retaining high visual quality (92-99% of standard quality)

View API → Video

Veo 3.1 fast

a speed-optimized variant of Google's flagship generative AI video model, designed to produce high-quality, 1080p video about 2x faster than the Standard model while maintaining nearly identical quality. It is ideal for rapid prototyping, social content, and lowering costs (approx. 1/5th of standard) for creatives and developers

View API → Video

Video Captions

Burn highly legible, timestamped subtitles or timed text onto your videos to instantly boost social media engagement and ensure accessibility on silent feeds.

View API → Video

Video Filters

Apply an Instagram-style filter preset to a video (vintage, B&W, sepia, cinematic, etc.). Single ffmpeg pass, audio preserved.

View API →

AI API Toolsready to call

Add border

Add image on image

Banner on video

Bria Remove Background

Brightness & Contrast

Check Resolution

Check Transparency

Crop with padding

Detect and Crop

Dremina Seedance 2.00

Extract Colors

Fabric Swap / Material Swap

Fancy text on images

Flip and Mirror

Flux 2 Max

Flux Fill Pro

Format Converter

Format Converter Video

Gaussian Blur

Gemini 2.5 Flash Lite

Gemini 3.1 Flash Lite Preview

Google Upscaler

Grok 3 mini

Grok Imagine Video

Grounding Dino Auto Detect

Image Metadata Extractor

Image on video

Image Resizer

Invert Colors

Kling V2.5 Turbo Pro

Mask Checker

Multi Image Kontext Pro

Nano Banana

Nano Banana 2

Nano Banana 2 lite

Nano Banana Pro

Nano Banana Pro Inpainting

OpenAI GPT-4o

OpenAI GPT-4o-mini

OpenAI GPT image 1.5

OpenAI o1

Place Image on Canvas

Real Esrgan Upscaler

Recraft Remove Background

Recraft Vectorize

Remove Solid Background

Rotate image

Sam3 Image

Seedream 4

Seedream 4.5

Segmentation Background Removal Birefnet

Sepia Tint

Sharpen

Smooth mask

Solid Crop

Stealth Mode

Transparent Crop

Turn To Grey

Veo 3.0 fast

Veo 3.1 fast

Video Captions

Video Filters