Skip to content

[FEATURE] daemon mode #25

@farnoy

Description

@farnoy

Feature Description

Persistent daemon mode that keeps the model in memory and provides lower end-to-end latency.

Problem Statement

When the model file is in the page cache, it's acceptably fast. But the latency could be far lower if the model was kept resident in GPU memory.

Proposed Solution

Keeping existing scripts and commands would be best, so ideally we'd have an optional daemon that the main command utilizes and forwards the audio to when it's running in the background.

Alternative Solutions

Use Case

  • people with enough VRAM that transcribe often

Implementation Ideas

Additional Context

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions