Spaces:
Build error
Build error
A newer version of the Gradio SDK is available:
6.8.0
metadata
title: Qwen2-VL Amazon Listing Generator
emoji: π
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: apache-2.0
π Qwen2-VL Amazon Listing Generator (LoRA)
This Hugging Face Space showcases a fine-tuned Qwen2-VL-7B model with LoRA adapter trained to generate Amazon-style product listings from product images.
π Features
- Vision-Language Model: Qwen2-VL-7B-Instruct with custom LoRA adapter
- Amazon Listing Generation: Creates structured product listings with:
- Product title
- Bullet points (key features)
- Product description
- Keywords
- Product category
- CPU Optimized: Runs on free CPU hardware (may take 1-2 minutes per generation)
π§ Model Details
- Base Model: Qwen/Qwen2-VL-7B-Instruct
- LoRA Adapter: soupstick/qwen2vl-amazon-ft-lora
- Fine-tuning: Specialized for e-commerce product listing generation
π― How to Use
- Upload Image: Click on the image upload area and select a product photo
- Optional Prompt: Modify the instruction if needed (default works well)
- Generate: Click "Generate Listing" and wait for results
- Review Output: Get structured Amazon-style listing in JSON format
π Expected Output Format
{
"title": "Product Title Here",
"bullet_points": [
"β’ Key feature 1",
"β’ Key feature 2",
"β’ Key feature 3"
],
"description": "Detailed product description...",
"keywords": "relevant, product, keywords",
"category": "Product > Category > Subcategory"
}
β‘ Performance Notes
- CPU Mode: This demo runs on CPU hardware for free access
- Processing Time: 1-2 minutes per generation due to CPU limitations
- Image Size: Automatically resized to 512px for optimal performance
- Memory Optimized: Uses float32 and low memory settings
π Links
β οΈ Limitations
- Demo Purpose: This is a prototype for concept demonstration
- Accuracy: Results depend on training data quality and model size
- Speed: CPU inference is slower than GPU (upgrade hardware for faster results)
- Languages: Primarily trained on English product descriptions
π οΈ Technical Stack
- Framework: Transformers, PEFT (LoRA), Gradio
- Model: Qwen2-VL-7B with custom LoRA adapter on Unsloth-AI
- Hardware: CPU-optimized for Hugging Face Spaces free tier
Built with β€οΈ using Hugging Face Spaces