Spaces:

lxzcpro
/

TextEraser

Running

App Files Files Community

TextEraser / README.md

lxzcpro

Update README.md

2c71a88 verified 10 days ago

preview code

raw

history blame contribute delete

2.5 kB

A newer version of the Gradio SDK is available: 6.1.0

Upgrade

metadata

title: TextEraser
emoji: 🧼
colorFrom: blue
colorTo: indigo
sdk: gradio
app_file: app.py
pinned: false
license: mit
short_description: Interactive Text-Guided Object Removal
sdk_version: 6.0.2

TextEraser

Text-Guided Object Removal using SAM2 + CLIP + Stable Diffusion XL

Final Project for COMPSCI372: Intro to Applied Machine Learning @ Duke University (Fall 2025)

Overview

TextEraser intelligently removes objects from images using natural language descriptions. Simply type what you want to remove (e.g., "bottle", "person", "car"), and the AI pipeline handles the rest.

Key Features

Natural language control - Remove objects by describing them in plain text
Smart segmentation - Uses SAM2 to find all objects in the image
Intelligent matching - CLIP identifies which segments match your description
Seamless inpainting - Stable Diffusion XL fills in the removed area naturally
Multi-part object handling - Automatically merges related segments (e.g., cat + tail)
Interactive web interface - Real-time Gradio UI with debug visualization

How It Works

The pipeline has three stages:

Segmentation (SAM2) - Generates candidate object masks across the image
Matching (CLIP) - Scores each segment against your text query
Inpainting (SDXL) - Fills the masked region with contextually appropriate content

Installation

Requirements

Python 3.8+
CUDA GPU with 12GB+ VRAM (recommended)
~10GB disk space for models

Setup

# Clone repository
git clone https://github.com/lxzcpro/TextEraser.git
cd TextEraser

# Install dependencies
pip install -r requirements.txt

# Run the app
python app.py

On first run, models will auto-download from HuggingFace (~10GB total).

Usage

Web Interface

Launch the app: python app.py
Upload an image
Enter what to remove (e.g., "bottle", "car", "person")
Optionally specify background fill (default: "background")
Click "Run Pipeline"
Check the debug tab to see what was detected

Python API

from src.pipeline import ObjectRemovalPipeline
from PIL import Image
import numpy as np

# Initialize pipeline
pipeline = ObjectRemovalPipeline()

# Load and process image
image = np.array(Image.open("photo.jpg"))
result, mask, message = pipeline.process(
    image=image,
    text_query="bottle",
    inpaint_prompt="table surface"
)

# Save result
Image.fromarray(result).save("result.jpg")