Skip to content
@VIPL-Audio-Visual-Speech-Understanding

VIPL AVSU

Audio-Visual Speech Understanding Research Group at Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences, ICT, CAS

Pinned Loading

  1. VIPL-AVSU-Group VIPL-AVSU-Group Public

    Collection of works from VIPL-AVSU

    45 5

  2. CAS-VSR-S101 CAS-VSR-S101 Public

    CAS-VSR-S101: A large-scale Mandarin dataset from TV broadcasts for audio-visual speech research

    7 1

  3. learn-an-effective-lip-reading-model-without-pains learn-an-effective-lip-reading-model-without-pains Public

    The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the state-of-art performance in LRW-1000 dataset.

    Python 164 38

  4. CAS-VSR-MOV20 CAS-VSR-MOV20 Public

    CAS-VSR-MOV20: A challenging dataset for Chinese visual speech recognition, consisting of video clips from 20 movies.

    1

  5. Lipreading-DenseNet3D Lipreading-DenseNet3D Public

    DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.06990

    Python 118 21

  6. CAS-VSR-S68 CAS-VSR-S68 Public

    CAS-VSR-S68: A dataset for lip reading with unseen speakers, spanning 68 hours of news broadcasts.

    7

Repositories

Showing 10 of 11 repositories

Top languages

Loading…

Most used topics

Loading…