Abstract: Vision-language models (VLMs) offer flexible object detection through natural language prompts but suffer from performance variability depending on prompt phrasing. In this paper, we ...
Pocket TTS delivers high-quality text-to-speech on standard CPUs. No GPU, no cloud APIs. It is the first local TTS with voice ...
prompt-maker is an interactive terminal application built with Go and Bubble Tea. It takes your rough idea for a prompt, sends it to a Gemini model with a specialized "prompt optimization" system ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results