Search

Your search keyword '"Pantic, Maja"' showing total 1,040 results

Search Constraints

Start Over You searched for: Author "Pantic, Maja" Remove constraint Author: "Pantic, Maja"
1,040 results on '"Pantic, Maja"'

Search Results

1. RT-LA-VocE: Real-Time Low-SNR Audio-Visual Speech Enhancement

2. Dynamic Data Pruning for Automatic Speech Recognition

3. MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization

4. EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars

5. BRAVEn: Improving Self-Supervised Pre-training for Visual and Auditory Speech Recognition

6. Audio-visual video-to-speech synthesis with synthesized input audio

7. SparseVSR: Lightweight and Noise Robust Visual Speech Recognition

8. Large-scale unsupervised audio pre-training for video-to-speech synthesis

9. Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models

10. SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision

11. Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels

12. Learning Cross-lingual Visual Speech Representations

13. Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

14. Jointly Learning Visual and Auditory Speech Representations from Raw Data

15. LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders

16. FAN-Trans: Online Knowledge Distillation for Facial Action Unit Detection

17. Streaming Audio-Visual Speech Recognition with Alignment Regularization

18. SS-VAERR: Self-Supervised Apparent Emotional Reaction Recognition from Video

19. Training Strategies for Improved Lip-reading

20. SVTS: Scalable Video-to-Speech Synthesis

21. Self-supervised Video-centralised Transformer for Video Face Clustering

22. Visual Speech Recognition for Multiple Languages in the Wild

23. Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection

24. Defensive Tensorization

25. Domain Generalisation for Apparent Emotional Facial Expression Recognition across Age-Groups

26. EasyCom: An Augmented Reality Dataset to Support Algorithms for Easy Communication in Noisy Environments

27. FP-Age: Leveraging Face Parsing Attention for Facial Age Estimation in the Wild

28. LiRA: Learning Visual Speech Representations from Audio through Self-supervision

29. End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks

30. DINO: A Conditional Energy-Based GAN for Domain Translation

31. End-to-end Audio-visual Speech Recognition with Conformers

32. RoI Tanh-polar Transformer Network for Face Parsing in the Wild

33. Cauchy-Schwarz Regularized Autoencoder

34. Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection

36. Lip-reading with Densely Connected Temporal Convolutional Networks

37. Multilinear Latent Conditioning for Generating Unseen Attribute Combinations

38. Towards Practical Lipreading with Distilled and Efficient Models

39. Learning Speech Representations from Raw Audio by Joint Audiovisual Self-Supervision

40. Enhancing Facial Data Diversity with Style-based Face Aging

41. Dilated Convolutions with Lateral Inhibitions for Semantic Image Segmentation

42. Investigating Bias in Deep Face Analysis: The KANFace Dataset and Empirical Study

43. Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition?

44. Toward fast and accurate human pose estimation via soft-gated skip connections

45. Lipreading using Temporal Convolutional Networks

46. Visually Guided Self Supervised Learning of Speech Representations

47. Detecting Adversarial Attacks On Audiovisual Speech Recognition

48. Speech-driven facial animation using polynomial fusion of features

49. Towards Pose-invariant Lip-Reading

50. Shape Constrained Network for Eye Segmentation in the Wild

Catalog

Books, media, physical & digital resources