ASVSpoofing And Countermeasures Challenge 2019 題目解讀教育頭條網

ASVSpoofing And Countermeasures Challenge 2019 題目解讀

官網

https://www.asvspoof.org/官網大概的描述：

1、給了一個interspeech 2019會用得到的ppt
2、發佈了真實PA數據的時間
3、全部數據集發佈時間以及下載地址
4、ASRU 2019 Special Session的描述
5、關於ASVspoof 2019出版消息
6、ASVspoof 2019評價標準

ASVspoof 2019的註冊過程在評價標準裡面有寫，然後通過註冊後就可以獲得數據集的下載地址了。

ASVspoof 2019的時間表

1、19th December, 2018 Training and development data
2、8th February, 2019 Participant registration deadline
3、15th February, 2019 Evaluation data
4、22nd February, 2019 Scores submission
5、15th March, 2019 Results
6、29th March, 2019 Interspeech submission deadline
7、15th-19th September, 2019 ASVspoof 2019 special session at Interspeech 2019ASVspoof 2019的度量標準：t-DCF，然後附上了一篇論文。

ASVspoof 2019的baseline：又給了兩篇論文。大概就是這樣子。

ASVSpoof And Countermeasures Challenge 2019 評價標準解讀（很長，包括intro什麼的）

Introduction

先對ASVspoof 2013、2015、2017進行了總結。2013年提出了這個樣一個方向，

Data conditions: logical access

大概總結一下就是說VC和TTS技術做得越來越厲害了，我們要意識到這些技術帶來的問題，想要能夠做到用以前的vc方法得到的數據進行訓練，然後用先進的vc方法進行測試，看看能不能得到一個好的效果。然後整體講一下數據集怎麼來的：

1、基於一個多speaker的語音合成數據庫，叫做VCTK。

2、原始語音從107個speakers（46 male，61 female）的speech獲得，並且這些語音沒有significant channel或者background noise effects。

3、Spoofed 語音是使用一系列不同的spoofing算法對原始語音進行spoof得到的。

4、整個數據集被分為三份，training、development和evaluation。每一份數據中speaker都是不同的。

Training and development data

Training dataset:

-> genuine and spoofed speech from 20 speakers(8 male and 12 frmale)

-> spoofed utterance由2種VC方法和4種語音合成方法得到。

-> VC系統：

-> neural-network-based

-> transfer-function-based methods

-> 語音合成系統：

-> waveform concatenation

-> neural-network-based parametric speech synthesis using source-filter vocoders

-> neural-network-based parametric speech synthesis using Wavenet.

-> publicly available toolkits called Merlin, CURRENT and MaryTTS.

-> 訓練集種所有的數據都可以用來訓練detectors或者countermeasures，系統種所有的數據都是通過使用VCTK得到的。

Development dataset:

-> 和training dataset一樣，genuine and spoofed speech from 20 speakers(8 male and 12 frmale)

-> 得到spoofed utterance的方法和tringing中的一樣

-> development中得到spoofed utterance的方法和evaluation中得到的方法完全不同。

Evaluation data

-> 包括一系列沒見過的原始語音和spoofed speech

-> up-to-80k trails，大概有4G

-> 篡改方法不同於traing和development兩個過程的。

Performance measures

再性能評估上，以往兩屆是直接評估spoofing countermeasure（CMs），但是有個問題是可能再CM上表現好了，然而並不意味著對於能夠提升CM和ASV結合的系統的性能。因此ASVspoof 2019聚焦于衡量tandem系統的性能，在這個系統中，CM作為一個“gate”，先判斷語音是否為原始語音（nona fide）,然後再將其輸入到ASV系統中。tandem系統將考慮CM和ASV整體性能。Tandem detection cost function(t-DCF)作為首要的衡量因素。（t-DCF論文：t-DCF: a detection cost function for the tandem assessment of spooﬁng countermeasures and automatic speaker veriﬁcation）除了t-DCF之外還要評估EER。

分享到:

閱讀更多 從大數據到人工智能 的文章

關鍵字: 語音合成 2019你為誰驕傲技術