Text Behind Image
Place text behind the subject of any photo with automatic AI cut-out. Free, unlimited, and 100% private — your photos never leave your device.
100% Private
Images never leave your device
Auto cut-out
AI segments the subject for you
Live editor
Drag, color, scale — instant preview
Loading AI model (first time only)…
Drop an image to start
or click to browse
The first time you visit, the segmentation model downloads and is cached for future use.
Related Tools
About Text Behind Image
The text-behind-image effect is everywhere on Instagram, TikTok, and YouTube right now — a bold word like “MUSTANG”, “MAUI”, or “BEHIND” sits between the foreground subject and the background, so the subject appears to overlap and partially cover the type. Done well, it adds depth, drama, and a poster-like feel to a photo. Done in Photoshop, it takes a few minutes per image: select the subject, mask, add text underneath the mask, fine-tune. FormatFuse handles all of that in one click and lets you tweak the text live until it looks right.
Drop a JPG, PNG, or WebP into the editor and we automatically run a state-of-the-art segmentation model on your device to separate the subject from the background. While the cut-out runs, you can already type your text and pick a font, color, size, opacity, rotation, and letter spacing. The instant the cut-out finishes, the foreground snaps in front of your text and the canvas updates live. Drag the text around the image to position it; everything renders in real time at preview resolution and exports at full source resolution.
Unlike browser-based editors that ship your photo to a server, FormatFuse runs the entire pipeline locally inside your browser tab via WebAssembly and (where supported) WebGPU. There is no upload, no account, no watermark, and no usage cap — use it for product photos, social posts, lyric videos, gym posters, sports highlights, or anything else. The same on-device cut-out powers our background remover, so the quality is identical to a clean transparent PNG export, just composited into a stack of three layers behind your text.
Text Behind Image — Frequently Asked Questions
What is the text-behind-image effect?
It's a depth effect where bold text appears to sit behind the main subject of a photo. The subject (a person, car, animal, or product) overlaps and partially covers the text, which makes the photo feel layered and three-dimensional. FormatFuse builds it by stacking three layers on a canvas: the original image, your styled text, and a transparent PNG cut-out of the foreground subject on top.
Do I need to cut out the subject myself?
No — the cut-out is automatic. We run an AI segmentation model on your device the moment you drop your image. By the time you finish picking a font, the foreground is ready and snaps in front of your text. The same model powers our standalone Background Remover.
Will my photos be uploaded anywhere?
No. Everything runs inside your browser via WebAssembly and (where available) WebGPU. The only network activity is the first-time download of the segmentation model, which is then cached for future use. Your image never leaves your device.
Which fonts can I use?
We ship with a curated set of system-safe fonts (Impact, Arial Black, Inter, Georgia, Courier New) that work reliably across operating systems. You can adjust weight, size, color, opacity, rotation, and letter spacing for each. More font choices and custom font upload are on the roadmap.
Can I use it for commercial work?
Yes. There is no watermark, signup, or usage limit, and the segmentation model is permissively licensed. Use it for product listings, social posts, ads, posters, anything you like.
Why is the first use slower?
On first use the AI model is downloaded once (around 150 MB) and cached in your browser's IndexedDB. Every subsequent visit loads instantly from local cache, and processing typically takes 1–3 seconds per image on a modern laptop with WebGPU.
What input/output formats are supported?
Inputs: JPG, PNG, WebP, BMP, up to 100 MB. Output is always PNG at the source image's full resolution so you can drop it into Instagram, TikTok, Photoshop, Figma, Canva, Shopify, or anywhere else without quality loss.
What is the text-behind-image effect?
It's a depth effect where bold text appears to sit behind the main subject of a photo. The subject (a person, car, animal, or product) overlaps and partially covers the text, which makes the photo feel layered and three-dimensional. FormatFuse builds it by stacking three layers on a canvas: the original image, your styled text, and a transparent PNG cut-out of the foreground subject on top.
Do I need to cut out the subject myself?
No — the cut-out is automatic. We run an AI segmentation model on your device the moment you drop your image. By the time you finish picking a font, the foreground is ready and snaps in front of your text. The same model powers our standalone Background Remover.
Will my photos be uploaded anywhere?
No. Everything runs inside your browser via WebAssembly and (where available) WebGPU. The only network activity is the first-time download of the segmentation model, which is then cached for future use. Your image never leaves your device.
Which fonts can I use?
We ship with a curated set of system-safe fonts (Impact, Arial Black, Inter, Georgia, Courier New) that work reliably across operating systems. You can adjust weight, size, color, opacity, rotation, and letter spacing for each. More font choices and custom font upload are on the roadmap.
Can I use it for commercial work?
Yes. There is no watermark, signup, or usage limit, and the segmentation model is permissively licensed. Use it for product listings, social posts, ads, posters, anything you like.
Why is the first use slower?
On first use the AI model is downloaded once (around 150 MB) and cached in your browser's IndexedDB. Every subsequent visit loads instantly from local cache, and processing typically takes 1–3 seconds per image on a modern laptop with WebGPU.
What input/output formats are supported?
Inputs: JPG, PNG, WebP, BMP, up to 100 MB. Output is always PNG at the source image's full resolution so you can drop it into Instagram, TikTok, Photoshop, Figma, Canva, Shopify, or anywhere else without quality loss.

