Abstract: Effective prompt tuning is critical for using generative AI models, such as large language models (LLMs) and small language models (SLMs), for domain-specific tasks. However, optimizing ...
Abstract: Vision-language models (VLMs) offer flexible object detection through natural language prompts but suffer from performance variability depending on prompt phrasing. In this paper, we ...
Pocket TTS delivers high-quality text-to-speech on standard CPUs. No GPU, no cloud APIs. It is the first local TTS with voice ...