Skip to content

Provider Optimization: Add Recraft V3, Ideogram V3, FAL FLUX Speed & Qwen Edit #17

@merlinrabens

Description

@merlinrabens

Provider Optimization: Transform into Industry-Leading Image Generation MCP

Overview

Based on comprehensive market research and testing of all 9 current providers, this issue tracks the implementation of critical improvements to transform image-gen-mcp-server into an industry-leading 'monster' machine with unique capabilities no competitor offers.

Current Status

✅ All 9 providers operational (100% success rate)
✅ Average generation time: 8.7s
✅ Production-ready

Research Summary

See PROVIDER_OPTIMIZATION_REPORT.md for complete analysis.

Key Findings

  • Recraft V3 is now feat: Add multi-connection HTTP/SSE server support #1 globally (ELO 1172, 72% win rate)
  • Ideogram V3 launched March 2025 with enhanced capabilities
  • FAL FLUX offers 2x speed improvement over current BFL implementation
  • Qwen Edit provides best-in-class advanced editing capabilities

Proposed Additions

Phase 1: Critical Improvements (Priority: HIGH)

1. Add Recraft V3 Provider ⭐ NEW

Impact: CRITICAL - Unlocks unique capabilities

  • feat: Add multi-connection HTTP/SSE server support #1 globally on quality benchmarks (beats all current providers)
  • Unique vector generation (SVG output - no competitor offers this)
  • Perfect text rendering (guaranteed flawless)
  • Pricing: $0.04/image (competitive)
  • Use Cases: Logo design, branding, graphic design, print-ready materials

Implementation:

  • Create src/providers/recraft.ts
  • Add RECRAFT_API_KEY env var support
  • Register in Config
  • Add to provider selector with keywords (vector, logo, branding, text layout)
  • Support both raster and vector outputs
  • Add tests for both formats
  • Update documentation

API: https://www.recraft.ai/docs/api

2. Upgrade Ideogram V2 → V3 ⭐ UPGRADE

Impact: HIGH - Better text rendering and photorealism

  • Enhanced text rendering for complex/multi-line compositions
  • Improved photorealism and image-prompt alignment
  • Better API with more control options
  • Professional typography control

Implementation:

  • Update API endpoint from v2 to v3
  • Update model name references
  • Test for breaking changes
  • Update capabilities notes
  • Verify existing tests pass
  • Update documentation

3. Add FAL FLUX Speed Variant ⭐ NEW

Impact: MEDIUM-HIGH - 3x faster FLUX option

  • 1.5s generation time (vs current 4.5s with BFL)
  • Same quality as BFL FLUX
  • Pricing: ~$0.02/image (50% cheaper)
  • Perfect for rapid iterations

Implementation:

  • Option A: Create src/providers/fal-flux.ts (recommended)
  • Option B: Add model selection to existing FAL provider
  • Add to provider selector for "fast/quick/draft" keywords
  • Benchmark vs BFL FLUX
  • Add tests
  • Update documentation

API: https://fal.ai/models/fal-ai/flux-pro

Phase 2: Enhanced Editing (Priority: MEDIUM)

4. Add Qwen Image Edit Provider ⭐ NEW

Impact: HIGH - Professional-grade editing

  • Best-in-class inpainting/outpainting
  • Context-aware object manipulation
  • Professional editing suite
  • Intelligent fill for removed objects

Implementation:

  • Research Qwen Image Edit API availability
  • Create src/providers/qwen.ts
  • Add QWEN_API_KEY env var support
  • Implement inpainting, outpainting, object manipulation
  • Add mask/region support
  • Update edit tool to support new operations
  • Add tests
  • Update documentation

5. Verify/Upgrade Gemini to 2.5 Flash Image ⭐ VERIFY

Impact: MEDIUM - Enhanced editing capabilities

  • Confirm using latest Gemini 2.5 Flash Image
  • Test integrated editing features (inpainting, outpainting)
  • Verify model version in responses

Implementation:

  • Check current Gemini implementation
  • Verify model version used
  • Test editing capabilities
  • Update if necessary
  • Update documentation

Phase 3: Monitoring & Optimization (Priority: LOW)

6. Provider Health Tracking

  • Success/failure rate monitoring
  • Performance metrics collection
  • Cost tracking per provider
  • Automatic optimization

7. Enhanced Provider Selection

  • Use historical performance data
  • A/B testing framework
  • Fine-tune keyword matching

Expected Outcomes

Unique Capabilities Unlocked

✅ Vector generation (SVG) - NO COMPETITOR OFFERS THIS
✅ Perfect text rendering from 2 providers (Recraft + Ideogram V3)
✅ Ultra-fast FLUX option (1.5s vs 4.5s)
✅ Advanced editing suite (inpainting, outpainting, object manipulation)
#1 AND #2 globally ranked models (Recraft + FLUX)
✅ Complete use-case coverage with zero gaps

Performance Improvements

  • Faster average generation time with FLUX Speed
  • Lower costs with smart routing (~40% savings potential)
  • Better quality with Recraft V3 and Ideogram V3

Competitive Position

Current: Excellent, production-ready
After: Industry-leading with unique capabilities no other MCP server offers

Provider Stack After Implementation (12 Total)

Core Generation:

  1. RECRAFT V3 ⭐ NEW - Vector art, perfect text, graphic design
  2. BFL FLUX 1.1 Pro - Ultra-quality photorealism
  3. FAL FLUX Speed ⭐ NEW - Ultra-fast FLUX (1.5s)
  4. IDEOGRAM V3 ⭐ UPGRADE - Text rendering specialist
  5. LEONARDO - Artistic, cinematic, fantasy
  6. OPENAI DALL-E 3 - Versatile fallback
  7. GEMINI 2.5 Flash - Fast, multimodal

Specialized:
8. QWEN Edit ⭐ NEW - Advanced editing suite
9. CLIPDROP - Background removal, post-processing
10. STABILITY SD Core - Photorealism fallback

Experimental:
11. FAL SDXL - Ultra-fast drafts
12. REPLICATE - Open models access

Timeline

  • Phase 1: 1-2 weeks (Critical improvements)
  • Phase 2: 1-2 weeks (Enhanced editing)
  • Phase 3: 2 weeks (Monitoring/optimization)
  • Total: 4-6 weeks

Breaking Changes

None - All additions are additive and backward compatible

Manual Setup Required

See implementation branch for list of required API keys and account registrations.

Success Metrics

  • All 12 providers operational (100% success rate)
  • Vector generation capability tested and documented
  • Average generation time reduced with FLUX Speed option
  • Advanced editing operations (inpainting/outpainting) functional
  • Documentation updated with all new providers
  • All tests passing

References

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestperformancePerformance improvementspriority: highHigh priority issueproviderProvider-related improvements

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions