Open House Now — Property Listing

Whisper Hallucination Detection and Mitigation via Hidden Representation Steering and Sparse AutoEncoders

2026-06-05

Summary

A robotics research paper on Whisper Hallucination Detection and Mitigation via Hidden Representation Steering and Sparse AutoEncoders.

Property description

Whisper, a widely adopted ASR model, is known to suffer from hallucinations - coherent transcriptions generated for non-speech audio entirely disconnected from the input. We investigate whether hallucinations can be detected and mitigated through Whisper's internal representations. We extract audio encoder activations and evaluate two representation spaces: raw Whisper activations and Sparse AutoEncoder (SAE) latents. We show that both spaces encode linearly separable hallucination-related information, with discriminative power concentrated in a sparse feature subset and increasing toward deeper encoder layers. We propose two steering strategies: activation-space steering and SAE latent-space steering. SAE-based steering reduces hallucination rate from 72.63% to 14.11% for Whisper small and from 86.88% to 27.33% for Whisper large-v3 on the full non-speech test set, with small WER degradation on speech data, approaching the performance of fine-tuning-based methods.

Interested in this property?

Open House Now can help you schedule a visit, connect with the listing agent, and find similar homes for sale in this neighborhood.

Comments

No comments yet. Be the first to share your thoughts on this listing.

Whisper Hallucination Detection and Mitigation via Hidden Representation Steering and Sparse AutoEncoders

Summary

Property description

Links

Interested in this property?

Comments