logo

SPSC Symposium

4th Symposium on Security and Privacy in Speech Communication

and

3rd VoicePrivacy Challenge

September 6th, as an INTERSPEECH SATELLITE EVENT

Submission is now open

Submit your paper here

iscawhite

Speech and voice are media through which we express ourselves. Speech communication can be used to command virtual assistants, to transport emotion or to identify oneself. How can we strengthen security and privacy for speech representation types in user-centric human/machine interaction?

Interdisciplinary exchange is in high demand. The need to better understand and develop user-centric security solutions and privacy safeguard in speech communication is of growing importance for commercial, forensic, and government applications. The SPSC Symposium is a platform to seek better designed services and products as well as better informed policy papers for legislators and governance. The symposium is organized by the ISCA SPSC special interest group and the VoicePrivacy Challenge Team.

Last year's Proceedings

The proceedings are available via the ISCA Archive.

The fourth edition of the Symposium on Security & Privacy in Speech Communication, focuses on speech and voice through which we express ourselves. As speech communication can be used to command virtual assistants to transport emotion or to identify oneself, the symposium encourages participants to give answers on how we can strengthen security and privacy for speech representation types in user-centric human/machine interaction? The symposium therefore sees that interdisciplinary exchange is in high demand and aims to bring together researchers and practitioners across multiple disciplines – more specifically: signal processing, cryptography, security, human-computer interaction, law, ethics, and anthropology.

The VoicePrivacy initiative is spearheading the effort to develop privacy preservation solutions for speech technology. It aims to consolidate the newly formed community to develop the task and metrics and to benchmark progress in anonymization solutions using common datasets, protocols, and metrics. VoicePrivacy takes the form of a competitive challenge. In keeping with the previous VoicePrivacy Challenge editions, the current edition focuses on voice anonymization. Participants are required to develop anonymization systems to suppress speaker identity while keeping the content and paralinguistic attributes intact. This edition focuses on preserving the emotional state, which is the key paralinguistic attribute in many real-world applications of voice anonymization. All the participants are encouraged to submit to the SPSC Symposium papers related to their challenge entry, as well as other scientific papers related to speaker anonymization and voice privacy. More details can be found on the VoicePrivacy Challenge webpage: https://www.voiceprivacychallenge.org/.

For the general symposium, we welcome contributions to related topics, as well as progress reports, project dissemination, or theoretical discussions and “work in progress”. In addition, guests from academia, industry and public institutions as well as interested students are welcome to attend the conference without having to make their own contribution.

Although, we aim for meeting all of you on-site, we also opt for virtual presentations during the symposium.

For on-side participants that also join the Interspeech 2024 please register via Interspeech, for remote or on-side participants that will only visit the SPSC Symposium please write a mail to ingo.siegert@ovgu.de

Program

We are glad to announce the program, all times are Eastern European Summer Time, EEST:

Friday, September 6th 2024

  • 09:30 - 09:45 Arrival and coffee
    09:45 - 10:00 Welcome from the organisers
    10:00 - 11:15 5 SPSC talks, 4 in-person and 1 online
    • Examining the Interplay Between Privacy and Fairness for Speech Processing: A Review and Perspective
      Anna Leschanowsky, Sneha Das
    • Preserving spoken content in voice anonymisation with character-level vocoder conditioning
      Michele Panariello, Massimiliano Todisco, Nicholas Evans
    • CommonBench: A larger Scale Speaker Verification Benchmark
      Jan Hintz, Ingo Siegert
    • Scenario of Use Scheme: Threat Modelling for Speaker Privacy Protection in the Medical Domain
      Mehtab Ur Rahman, Martha Larson, Louis ten Bosch, Cristian Tejedor-García
    • Reassessing Noise Augmentation Methods in the Context of Adversarial Speech
      Karla Pizzi, Matias Patricio Pizarro Bustamante, Asja Fischer
    11:15 - 11:40 Coffee Break
    11:40 - 12:10 2 SPSC video talks
    • Is Greek safer than English? Investigating the Influence of Language on Adversarial Audio Attacks
      Karla Pizzi
    • Voice Conversion-based Privacy through Adversarial Information Hiding
      Jacob J Webber, Oliver Watts, Gustav Eje Henter, Jennifer Williams, Simon King
    12:10 - 12:40 VPC Overview
    Natalia Tomashenko (session chair: Emmanuel Vincent)
    12:40 - 13:10 2 VPC video talks
    • T8 JHU CLSP: HLTCOE JHU Submission to the Voice Privacy Challenge 2024
      Henry Li Xinyuan, Zexin Cai, Ashi Garg, Kevin Duh, Leibny Paola García-Perera, Sanjeev Khudanpur, Nicholas Andrews, Matthew Wiesner
    • T9 LongYuan: Speaker anonymization system with sentiment transfer and feature interpolation
      Tao Tan, Shutao Liu, Yibo Duan, Sheng Zhao, Xi Shao
    13:10 - 14:00 Lunch Break (Buffet included)
    14:00 - 15:45 7 VPC video talks
    session chairs: Martha Larson and Candy Olivia Mawalim
    • T10 NPU-NTU: NPU-NTU System for Voice Privacy 2024 Challenge
      Jixun YAO, Nikita Kuzmin, Qing Wang, Pengcheng Guo, Ziqian Ning, Dake Guo, Kong Aik Lee, Eng-Siong Chng, Lei Xie
    • T12 NTU-NPU: NTU-NPU System for Voice Privacy 2024 Challenge
      Nikita Kuzmin, Hieu-Thi Luong, Jixun YAO, Lei Xie, Kong Aik Lee, Eng-Siong Chng
    • T33 KIT-ISL: Voice Privacy - Investigating Voice Conversion Architecture with Different Bottleneck Features
      Seymanur Akti, Tuan Nam Nguyen, Yining Liu, Alex Waibel
    • T17 NKU HLT Lab; PANO: Facodec Anonymization System Enhanced with Prosody Anonymization
      Jiabei He, Jiaming Zhou, Haoqin Sun, ****Hui Wang, Yong Qin
    • T7 Anemone: Emotional Speech Anonymization: Preserving Emotion Characteristics in Pseudo-speaker Speech Generation
      Hua Hua, Zengqiang Shang, Xuyuan Li, Peiyang Shi, Chen Yang, Li Wang, Pengyuan zhang
    • T18 Q: Emotion-Enhanced Speaker Anonymisation Using the FreeVC Framework
      Yuqi Li, Yuanzhong Zheng, Jingyi Fang, Jinming Chen
    • T30 V-Beam: Voice Anonymization Using Emotion-Enriched Feature Integration with STT and TTS Models
      Jeongae Lee, Taeje Park, Yeawon You
    15:45 - 16:45 Coffee break with Poster session
    • Safeguarding Speech Content Style: Enhancing Privacy Beyond Speaker Identity
      Yamini Sinha, Mykola Raivakhovskyi, Martha Schubert, Ingo Siegert
    • Towards Audiovisual Anonymization for Remote Psychotherapy: a Subjective Evaluation
      Carlos Franzreb, Arnab Das, Hannes Gieseler, Eva Charlotte Jahn, Tim Polzehl, Sebastian Möller
    • Enhancing Speech Privacy with LPC Modifications
      Jule Pohlhausen, Francesco Nespoli, Jörg Bitzer
    • Speecher: Towards Privacy Ensuring Decoder Only Speech Reconstruction Through Disentanglement for German Speech Anonymization Using Any-to-Many Voice Conversion
      Arnab Das, Carlos Franzreb, Suhita Ghosh, Tim Polzehl, Sebastian Möller
    • T38 Orange_Shiva: Tuning DISSC for Voice Privacy Challenge 2024
      Olivier Le Blouch, Rayane BAKARI, Nicolas Gengembre
    • T25 USTC-PolyU: A Voice Anonymization Method Based on Content and Non-content Disentanglement for Emotion Preservation
      Wenju Gu, Zeyan Liu, Liping Chen, Rui Wang, Chenyang Guo, Wu Guo, Kong Aik Lee, Zhen-Hua Ling
    • T19 DFKI_SLT: Comparing Speech Anonymization Efficacy by Voice Conversion Using KNN and Disentangled Speaker Feature Representations
      Arnab Das, Carlos Franzreb, Tim Herzig, Philipp Pirlet, Tim Polzehl
    • T14 ADRES: Exploring Vector-quantized Variational Auto-Encoder with Prosody Parameters for Speaker Anonymization
      Sotheara Leang, Anderson Augusma, Dominique Vaufreydaz, Eric Castelli, Sethserey Sam, Frédérique Letué
    16:45 - 17:00 SIG's SPSC Townhall Meeting
    19:00 - open Social Event for on-site participants


    All time are given with respect to the UTC+03:00 zone. You can use a time zone converter to check the times in your time zone.

    Dates

    May, 05

    Paper submission opens

    June, 22nd

    Long and short paper submission deadline and VoicePrivacy Challenge paper

    July, 22nd

    Acceptance Notification (challenge paper)

    July, 30th

    Acceptance Notification (long and short paper)

    August, 15th

    Final (camera ready) paper submission

    September, 6th

    Symposium

    Location

    The Symposium is held at Astir Odysseus.

    The Astir Odysseus Kos has four conference rooms with modern design, equipped with high-end technologies and as a result even the most demanding event can be organized. Our professional staff with years of experience will be by your side to offer expertise and advice always based on your desires.

    In the following map you can see the Symposiums Location (1) and the Interspeech Location (2).

    How to reach?

    by Car:

    It is approx 18 minutes driving distance from the Interspeech Kipriotis Hotels & Conference Center (KICC) to the Astir Odysseus hotel.

    public transport:

    According to google maps there is unfortunately now public transport available. So please try to share taxis to reach the destination.

    Organizing Committee


    Ingo SIEGERT, Otto von Guericke University Magdeburg, Germany

    Jennifer WILLIAMS, University of Southampton, UK

    Sneha DAS, Technical University of Denmark, Denmark

    Natalia TOMASHENKO, Inria, France

    Nick EVANS, EURECOM, France

    Program Committee


    (alphabetical)

    Ajinkya Kulkarni, Idiap Research Institute, Switzerland

    Brij Mohan Lal Srivastava, Nijta, France

    Candy Olivia Mawalim, Japan Advanced Institute of Science and Technology, Japan

    David Boyle, Imperial College London, UK

    Emmanuel Vincent, Inria, France

    Gerald Penn, University of Toronto, Canada

    Hemlata Tak, Pindrop, USA

    Ingo Siegert, Otto von Guericke University Magdeburg, Germany

    Jennifer Williams, University of Southampton, UK

    Junichi Yamagishi, National Institute of Informatics, Japan

    Korbinian Riedhammer, Nuremberg Institute of Technology, Germany

    Lin Zhang, National Institute of Informatics, Japan

    Md Sahidullah, TCG CREST & Academy of Scientific and Innovative Research (AcSIR), India

    Natalia Tomashenko, Inria, France

    Nick Evans, EURECOM, France

    Pierre Champion, Inria, France

    Sarina Meyer, University of Stuttgart, Germany

    Sebastian Le Maguer, University of Helsinki, Finland

    Simon King, University of Edinburgh, UK

    Sneha Das, Technical University of Denmark, Denmark

    Tim Polzehl, DFKI, Germany

    Tom Bäckström, Aalto University, Finland

    Xiaoxiao Miao, Singapore Institute of Technology, Singapore

    Xin Wang, National Institute of Informatics, Japan

    You (Neil) Zhang, University of Rochester, USA

    Ziqian Luo, Carnegie Mellon University, USA

    Sponsors

    logo_dark_10x