早稲田大学 知覚情報システム・メディアインテリジェンス研究室

アーカイブ 2023年

学術論文

2024/01

David Wei Dai, Shungo Suzuki, Chen Guanliang, “Generative AI for professional communication training in intercultural contexts: Where we are and where we are heading,” Applied Linguistics Review, 2024. (To appear)

2023/12

Yusuke Fujita, Tetsuji Ogawa, Tetsunori Kobayashi, “Self-conditioning via intermediate predictions for end-to-end neural speaker dialization,” IEEE Access, vol. 11, pp. 140069-140076, Dec. 2023. [DOI]

国際会議

2024/01

Fuma Kurata, Mao Saeki, Masaki Eguchi, Shungo Suzuki, Hiroaki Takatsu, Yoichi Matsuyama, “Development and Validation of Engagement and Rapport Scales for Evaluating User Experience in Multimodal Dialogue Systems”, Proc. 14th International Workshop on Spoken Dialogue Systems Technology (IWSDS2024), March 2024. (to appear)

2024/01

Tomoki Ariga, Yosuke Higuchi, Kazutoshi Hayasaka, Naoki Okamoto, Tetsuji Ogawa, “Parody detection using source-target attention with teacher-forced lyrics,” Proc. 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2024), April 2024. (to appear)

2023/12

Kohei Saijo, Wangyou Zhang, Zhong-Qiu Wang, Shinji Watanabe, Tetsunori Kobayashi, Tetsuji Ogawa, “A single speech enhancement model unifying dereverberation, denoising, speaker counting, separation, and extraction,” Proc. 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU2023), pp.XXX-XXX, Dec. 2023.

2023/12

Taiki Inoue, Jun Ogata, Makoto Iida, Tetsuji Ogawa, “Learning discriminative feature representation via metric learning for early operation of wind turbine anomaly detection systems,” Proc. 22nd International Conference on Machine Learning and Applications (ICMLA2023), pp.1292-1297, Dec. 2023.

2023/11

Ahmed Hammad Azab, Ahmed Bayoumi Zaki, Tetsuji Ogawa, Walid Gomaa, “Masry: A text-to-speech system for the Egyptian Arabic,” Proc. 20th International Conference on Informatics in Control, Automation, and Robotics (ICINCO2023), pp.219-226, Nov. 2023. [DOI]

2023/09

Rami Zewail, Jacob Leonard, Tetsuji Ogawa, Samir El-Sagheer, “Lightweight multiscale attention-aware method for semantic segmentation of urban structural buildings in drone aerial imagery,” Proc. 2023 International Mobile, Intelligent, and Ubiquitous Computing Conference (MIUCC2023), pp.338-344, Sept. 2023. [DOI]

2023/09

Yuta Ide, Naohiro Tawara, Susumu Saito, Teppei Nakano, Tetsuji Ogawa, “Voice or Content? — Exploring impact of speech content on age estimation from voice,” Proc. the 31st European Signal Processing Conference (EUSIPCO2023), pp.221-225, Sept. 2023. [DOI]

2023/09

Tomoki Ariga, Yosuke Higuchi, Mitsunori Kanno, Rie Shigyo, Takato Mizuguchi, Naoki Okamoto, Tetsuji Ogawa, “Spotting parodies: Detecting alignment collapse between lyrics and singing voice,” Proc. the 31st European Signal Processing Conference (EUSIPCO2023), pp.286-290, Sept. 2023. [DOI]

2023/09

Huaibo Zhao, Yosuke Higuchi, Yusuke Kida, Tetsuji Ogawa, Tetsunori Kobayashi, “Mask-CTC-based encoder pre-training for streaming end-to-end speech recognition,” Proc. the 31st European Signal Processing Conference (EUSIPCO2023), pp.56-60, Sept. 2023. [DOI]

2023/08

Jin Sakuma, Shinya Fujie, Huaibo Zhao, Tetsunori Kobayashi, “Improving the response timing estimation for spoken dialogue systems by reducing the effect of speech recognition delay,” Proc. The 24th Annual Conference of the International Speech Communication Association (INTERSPEECH2023), pp.2668-2672, Aug. 2023. [DOI]

2023/08

Fuma Kurata, Mao Saeki, Shinya Fujie, Yoichi Matsuyama, “Multimodal turn-taking model using visual cues for end-of-utterance prediction in spoken dialogue systems,” Proc. The 24th Annual Conference of the International Speech Communication Association (INTERSPEECH2023), pp.2658-2662, Aug. 2023. [DOI] [ISCA Best Student Paper Award 2023]

2023/08

Kohei Saijo, Tetsuji Ogawa, “Remixing-based unsupervised source separation from scratch,” Proc. The 24th Annual Conference of the International Speech Communication Association (INTERSPEECH2023), pp.1678-1682, Aug. 2023. [DOI] [Scopus]

2023/07

Ryuki Matsuura, Shungo Suzuki, “Prompt-independent automated scoring of L2 oral fluency by capturing prompt effects,” Proc. 24th International Conference on Artificial Intelligence in Education (AIED2023), pp.720-726, July 2023. [DOI]

2023/06

Fatma Youssef, Ahmed El-Mahdy, Tetsuji Ogawa, Walid Gomaa, “Thermal gait dataset for deep learning-oriented gait recognition,” Proc. 2023 International Joint Conference on Neural Networks (IJCNN2023), pp.1-8, June 2023. [DOI] [Scopus]

2023/06

Haruki Konii, Teppei Nakano, Yasumasa Miyazawa, Tetsuji Ogawa, “Narrow down forecast range: Using knowledge of past operations and attribute-dependent thresholding in good fishing ground prediction,” Proc. MTS/IEEE OCEANS 2023 Limerick Conference and Exhibit (OCEANS2023), June 2023. [DOI]

2023/06

Yusuke Fujita, Tatsuya Komatsu, Robin Scheibler, Yusuke Kida, Tetsuji Ogawa, “Neural diarization with non-autoregressive intermediate attractor,” Proc. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2023), June 2023. [DOI]

2023/06

Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi, Shinji Watanabe, “InterMPL: Momentum pseudo-labeling with intermediate CTC loss,” Proc. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2023), June 2023. [DOI]

2023/06

Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi, Shinji Watanabe, “BECTRA: Transducer-based end-to-end ASR with BERT-enhanced encoder,” Proc. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2023), June 2023. [DOI]

2023/06

Kohei Saijo, Tetsuji Ogawa, “Self-Remixing: Unsupervised speech separation via separation and remixing,” Proc. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2023), June 2023. [DOI]

2023/06

Huaibo Zhao, Shinya Fujie, Tetsuji Ogawa, Jin Sakuma, Yusuke Kida, Tetsunori Kobayashi, “Conversation-oriented ASR with multi-look-ahead CBS architecture,” Proc. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2023), June 2023. [DOI]

研究会・シンポジウム

2023/12

倉田楓真,佐伯真於,江口政貴,鈴木駿吾,高津弘明,松山洋一,“対話体験品質評価手法の検討:ロールプレイと議論対話におけるエンゲージメントとラポールの分析,” 第14回対話システムシンポジウム,Dec. 2023.

2023/12

佐伯真於,高津弘明,倉田楓真,鈴木駿吾,江口政貴,松山洋一,“LANGX Speaking 会話エージェントによる英会話能力判定システム,” 第14回対話システムシンポジウム,Dec. 2023.

2023/11

井上太揮,緒方淳,飯田誠,小川哲司,“風車異常検知システム早期運用のための距離学習を用いた識別的な特徴表現の学習,” 第45回風力エネルギー利用シンポジウム,Nov. 2023. [優秀発表賞]

2023/11

若山拓矢,井上太揮,緒方淳,飯田誠,小川哲司,“M-measureを用いた特徴抽出に基づく回転速度に頑健な風車異常検知,” 第45回風力エネルギー利用シンポジウム,Nov. 2023.

2023/07

望田康太,中野鐵兵,藤江真也,若林麻里,佐藤朝美,小川哲司,“重症心身障害児を対象とした顔表情に基づく感情状態推定のための事前学習モデルに関する検討,” 第26回画像の認識・理解シンポジウム (MIRU2023),IS1-104,pp.1-4,July 2023.

2023/07

中田道寛,斎藤奨,中野鐵兵,小川哲司,“映像監視に基づく意思決定支援のための事前学習モデルの構築法と繁殖牛の分娩検知への応用,” 第26回画像の認識・理解シンポジウム (MIRU2023),IS1-101,pp.1-4,July 2023.

2023/06

有賀智輝,樋口陽祐,菅野光則,執行里恵,水口天都,岡本直紀,小川哲司,“歌詞と歌唱音声のアライメント崩れに基づく替え歌検知,” 電子情報通信学科技術研究報告(SP),vol.123,no.88,SP2023-10,pp.48-53,June 2023.

全国大会

2024/01

菅野竜雅,佐藤裕明,熊野正,河合吉彦,小川哲司,“発音出力を利用したchain of thought 音声認識,” 日本音響学会研究発表会講演論文集,March 2024. (to appear)

2024/01

楠奈穂美,樋口陽祐,小川哲司,小林哲則,“再帰的フィードバックを用いた階層的マルチタスク学習によるEnd-to-End音声認識,” 日本音響学会研究発表会講演論文集,March 2024. (to appear)

2023/09

当間佐耶佳,有賀智輝,樋口陽祐,早坂一寿,岡本直紀,小川哲司,“深層話者埋め込みを用いた歌唱者の照合に関する検討,” 日本音響学会研究発表会講演論文集,Sept. 2023.

2023/09

佐藤裕明,菅野竜雅,佐久間旭,河合吉彦,熊野正,山田一郎,小川哲司,“Streaming transducerにおけるテキストのみを用いた学習方法に関する検討,” 日本音響学会研究発表会講演論文集,Sept. 2023.

2023/09

西城耕平,小川哲司,“音源の分離と再混合による事前学習を必要としないモノラル教師なし音源分離,” 日本音響学会研究発表会講演論文集,Sept. 2023.

2023/09

謝佳臻,藤江真也,小林哲則,“情報伝達のための音声合成における発話文の役割情報付与手法の検討,” 日本音響学会秋季研究発表会講演論文集,Sept. 2023.

2023/09

藤江真也,小林哲則,“非流暢現象ラベル付き発音形認識モデルとテキスト変換モデルを組み合わせた音声認識システム,” 日本音響学会秋季研究発表会講演論文集,Sept. 2023.

2023/09

谷口友紀,藤江真也,小坂直敏,小林哲則,“発話タイミング推定における時間心理尺度の考慮,” 日本音響学会秋季研究発表会講演論文集,Sept. 2023.

2023/09

樋口陽祐,小川哲司,小林哲則,渡部晋治,“事前学習済みマスク言語モデルを用いたEnd-to-end音声認識,” 日本音響学会研究発表会講演論文集,Sept. 2023.

2023/09

Huaibo Zhao, Shinya Fujie, Tetsuji Ogawa, Tetsunori Kobayashi, “An investigation on constructing multi-look-ahead contextual block streaming transducer,” 日本音響学会研究発表会講演論文集,Sept. 2023.

2023/09

有賀智輝,樋口陽祐,早坂一寿,岡本直紀,小林哲則,小川哲司,“Teacher-Forcingにより歌詞を与えた際のAttentionの崩れに着目した替え歌検知,” 日本音響学会研究発表会講演論文集,Sept. 2023.

2023/08

菅野竜雅,佐藤裕明,佐久間旭,熊野正,河合吉彦,山田一郎,小川哲司,“字幕制作効率化のための音声認識エラー検出手法,” 映像メディア学会2023年年次大会,Aug. 2023.

2023/07

望田康太,岸凌祐,大矢耀介,中野鐵兵,藤江真也,佐藤朝美,小川哲司,“アクションユニットを用いた重症心身障害児の感情状態推定,” 第24回日本医療情報学会看護学術大会,pp.139-140,July 2023.

© 2015 Perceptual Computing Group, Waseda University. All Rights Reserved

page-archive-2023