×

SafeWatch-Bench

Measures
Minor safety
Score
0–100
Ships in
Lemu Kids
Modality
Audio + visual

The concept

Multimodal safety scoring on children's video — transcript, frames, and family values aligned.

Inputs include speech transcripts, key frames, and metadata. Outputs are calibrated 0–100 scores with explainable sub-dimensions parents can configure.

How we did it

We measure cross-modal agreement, false negatives on subtle risks, and alignment drift when family profiles change. Every content score in Lemu Kids is regression-tested here before release.

Previous eval
←Verify-Bench
Next eval
Compliance-Bench→