whisper.cpp/src
Yshtola f53dc74843
whisper : Fix UTF-8 character boundary issue in segment wrapping (max_len) (#3592)
The current implementation in `whisper_wrap_segment()` uses `strlen()` to count bytes, not UTF-8 characters. When splitting segments at `max_len`, this can break multi-byte UTF-8 characters, resulting in invalid sequences displayed as `�` (U+FFFD replacement character).
2026-01-16 14:16:05 +02:00
..
coreml coreml : backport CoreML features to macos < 14 (#3255) 2025-06-24 09:24:27 +02:00
openvino whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
CMakeLists.txt examples : fix executable example targets (#3600) 2026-01-13 08:08:18 +01:00
whisper-arch.h vad : add initial Voice Activity Detection (VAD) support (#3065) 2025-05-12 16:10:11 +02:00
whisper.cpp whisper : Fix UTF-8 character boundary issue in segment wrapping (max_len) (#3592) 2026-01-16 14:16:05 +02:00