GitHub - liutaocode/DiarizationVisualization: Visualization tools for audio-only and multi-modal speaker diarization dataset
![Applied Sciences | Free Full-Text | Active Correction for Incremental Speaker Diarization of a Collection with Human in the Loop Applied Sciences | Free Full-Text | Active Correction for Incremental Speaker Diarization of a Collection with Human in the Loop](https://pub.mdpi-res.com/applsci/applsci-12-01782/article_deploy/html/images/applsci-12-01782-g001-550.jpg?1644829543)
Applied Sciences | Free Full-Text | Active Correction for Incremental Speaker Diarization of a Collection with Human in the Loop
![Multimodal speaker diarization pipeline with pre-trained audio-visual... | Download Scientific Diagram Multimodal speaker diarization pipeline with pre-trained audio-visual... | Download Scientific Diagram](https://www.researchgate.net/publication/337511134/figure/fig1/AS:829304678141953@1574733051941/Multimodal-speaker-diarization-pipeline-with-pre-trained-audio-visual-synchronization.png)
Multimodal speaker diarization pipeline with pre-trained audio-visual... | Download Scientific Diagram
![The concepts of beam search decoding in the context of ASR and Speaker... | Download Scientific Diagram The concepts of beam search decoding in the context of ASR and Speaker... | Download Scientific Diagram](https://www.researchgate.net/publication/373838656/figure/fig1/AS:11431281188009075@1694489072188/The-concepts-of-beam-search-decoding-in-the-context-of-ASR-and-Speaker-Diarization-SD_Q320.jpg)
The concepts of beam search decoding in the context of ASR and Speaker... | Download Scientific Diagram
![Example scenes in audio-visual diarization datasets. Existing datasets... | Download Scientific Diagram Example scenes in audio-visual diarization datasets. Existing datasets... | Download Scientific Diagram](https://www.researchgate.net/publication/356633075/figure/fig1/AS:1095750100500480@1638258590102/Example-scenes-in-audio-visual-diarization-datasets-Existing-datasets-AMI-7-and.png)
Example scenes in audio-visual diarization datasets. Existing datasets... | Download Scientific Diagram
![AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario: Paper and Code - CatalyzeX AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario: Paper and Code - CatalyzeX](https://www.catalyzex.com/_next/image?url=https%3A%2F%2Fai2-s2-public.s3.amazonaws.com%2Ffigures%2F2017-08-08%2F22d6e607fde6c79a7320bf2f4a37ff04ff7658b3%2F3-Table1-1.png&w=640&q=75)
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario: Paper and Code - CatalyzeX
GitHub - X-LANCE/MSDWILD: [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.
![Automatic speaker diarization for natural conversation analysis in autism clinical trials | Scientific Reports Automatic speaker diarization for natural conversation analysis in autism clinical trials | Scientific Reports](https://media.springernature.com/m685/springer-static/image/art%3A10.1038%2Fs41598-023-36701-4/MediaObjects/41598_2023_36701_Fig1_HTML.png)