English Publications of Spoken Language Lab. (Since Apr. 1997)


(Click here to see English publications of Prof. Yamashita)
(Click here to see Japanese publications of the lab..)

2016

A.Morimoto, M.Niitsuma and Y.Yamashita
Age estimation in Japanese speech based on feature selection
5th Joint Meeting of the Acoustical Society of America and Acoustical Society of Japan, 5pSCb31 (2016.12).

Y.Yamamoto, M.Niitsuma and Y.Yamashita
Automatic recognition of negative emotion in speech using support vector machine
5th Joint Meeting of the Acoustical Society of America and Acoustical Society of Japan, 5aSC49 (2016.12).

C.Terayama, M.Niitsuma and Y.Yamashita
Development of a communication system using audio signal using audible frequency bands
5th Joint Meeting of the Acoustical Society of America and Acoustical Society of Japan, 2aSPb37 (2016.11).

2015

D.Khanh Ninh and Y.Yamashita
F0 Parameterization of Glottalized Tones in HMM-Based Speech Synthesis for Hanoi Vietnamese
IEICE Transactions on Information and Systems, E98-D, 12, pp.2280-2289 (2015.12).

D.Khanh Ninh and Y.Yamashita
F0 Parameterization of Glottalized Tones for HMM-Based Vietnamese TTS
Proc. of the Interspeech 2015 (Interspeech2015), pp.2202-2206 (2015.9).

2013

I.Sakamoto, K.Cho, M.Morise and Y.Yamashita
YLAB@RU at Spoken Term Detection Task in NTCIR10-SpokenDoc-2
Proc. of the 10th NTCIR Conference (NTCIR-10), pp.638-642 (2013.6).

D.Khanh Ninh, M.Morise and Y.Yamashita
A Generation Error Function Considering Dynamic Properties of Speech Parameters for Minimum Generation Error Training for Hidden Markov Model-based Speech Synthesis
Acoustical Science and Technology, 34, 2, pp.123-132 (2013.3).

M.Nakayama, T.Nishiura, Y.Yamashita and N.Nakasako
Multiple-nulls-steering Beamformer Based on Both Talker and Noise Direction-of-arrival Estimation
Acoustical Science and Technology, 34, 2, pp.80-88 (2013.3).

Y.Yamashita
A Review of Paralinguistic Information Processing for Natural Speech Communication
Acoustical Science and Technology, 34, 2, pp.73-79 (2013.3).

2012

D.Khanh Ninh, M.Morise and Y.Yamashita
Incorporating Dynamic Features into Minimum Generation Error Training for HMM-Based Speech Synthesis
Proc. of the 8th International Symposium on Chinese Spoken Language Processing (ISCSLP2012), PaperID:O3.2, pp.55-59 (2012.12).

D.Khanh Ninh, M.Morise and Y.Yamashita
An adaptive weighting approach for minimum generation error training considering dynamic features in HMM-based speech synthesis
Proc. of 2012 Autumn Meeting of Acoustical Society of Japan, 3-Q-11, pp.383-386 (2012.9).

D.Khanh Ninh, K.Cho and Y.Yamashita
Introduction of duration models and dynamic features in MGE training for HSMM-based speech synthesis
Proc. of 2012 Spring Meeting of Acoustical Society of Japan, 1-R-3, pp.431-434 (2012.3).

2011

Y.Yamashita, T.Matsunaga and K.Cho
YLAB@RU at Spoken Term Detection Task in NTCIR9-SpokenDoc
Proc. of the 9th NTCIR Workshop Meeting (NTCIR-9), SpokenDoc10, pp.287-290 (2011.12).

T.Fukumori, M.Morise, T.Nishiura, Y.Yamashita and H.Nanjo
The estimation of optimum subtraction parameters for iterative spectral subtraction towards musical tone reduction
Proc. of Internoise2011 (Internoise2011), PaperID:Mon-P-21 (2011.9).

M.Nakayama, T.Nishiura, Y.Yamashita and N.Nakasako
Parallel beamformer based on both talkers and noises localization
Proc. of Internoise2011 (Internoise2011), PaperID:Mon-P-19 (2011.9).

K.Horii, T.Fukumori, M.Morise, T.Nishiura and Y.Yamashita
Musical tone reduction based on auditory sense for spectral subtraction
Proc. of Internoise2011 (Internoise2011), PaperID:Mon-P-22 (2011.9).

2010

K.Cho, T.Nishiura and Y.Yamashita
Robust Speaker Localization in a Disturbance Noise Environment Using a Distributed Microphone System
Proc. of the 7th International Symposium on Chinese Spoken Language Processing (ISCSLP2010), PaperID:P1.6, pp.1-5 (2010.11).

Y.Itoh, H.Nishizaki, X.Hu, H.Nanjo, T.Akiba, T.Kawahara, S.Nakagawa, T.Matsui, Y.Yamashita and K.Aikawa
Constructing Japanese Test Collections for Spoken Term Detection
Proc. of INTERSPECH 2010 (INTERSPEECH2010), pp.677-680 (2010.9).

A.Yamamoto, K.Suzuki, K.Cho and Y.Yamashita
Automatic Prosodic Labeling of Accent Information for Japanese Spoken Sentences
Proc. of the 7th ISCA Tutorial and Research Workshop on Speech Synthesis (SSW7), P-2.4, pp.300-305 (2010.9).

K.Cho, H.Okumura, T.Nishiura and Y.Yamashita
Multiple Sound Source Localization Based on Inter-Channel Correlation Using a Distributed Microphone System in a Real Environment
IEICE Transactions on Information and Systems, E93-D, 9, pp.2463-2471 (2010.9).

K.Cho, T.Nishiura and Y.Yamashita
Localization of Multiple Sound Sources Based on Subtraction of Accumulated Inter-channel Correlation
Proc. of the 20th International Congress on Acoustics (ICA2010), PaperID:27, pp.1-4 (2010.8).

K.Hayashida, Y.Mizoguchi, J.Ogawa, M.Morise, T.Nishiura and Y.Yamashita
The Acoustic Sound Field Dictation with Hidden Markov Model Based on an Onomatopoeia
Proc. of the 20th International Congress on Acoustics (ICA2010), PaperID:171, pp.1-5 (2010.8).

2009

K.Katsurada, A.Lee, T.Kawahara, T.Yotsukura, S.Morishima, T.Nishimoto, Y.Yamashita and T.Nitta
Development of a Toolkit for Spoken Dialog Systems with an Anthropomorphic Agent: GALATEA
Proc. of 2009 APSIPA Annual Summit and Conference (APSIPA ASC 2009), MP-SS1-5 (2009.10).

T.Akiba, K.Aikawa, Y.Itoh, T.Kawahara, H.Nanjo, H.Nishizaki, N.Yasuda, Y.Yamashita and K.Itou
Developing an SDR Test Collection from Japanese Lecture Audio Data
Proc. of 2009 APSIPA Annual Summit and Conference (APSIPA ASC 2009), TA-SS1-2 (2009.10).

K.Cho, T.Nishiura and Y.Yamashita
A Study on Multiple Sound Source Localization with a Distributed Microphone System
Proc. of the Interspeech 2009 (Interspeech2009), pp.1359-1362 (2009.9).

T.Akiba, K.Aikawa, Y.Itoh, T.Kawahara, H.Nanjo, H.Nishizaki, N.Yasuda, Y.Yamashita and K.Itou
Construction of a Test Collection for Spoken Document Retrieval from Lecture Audio Data
情報処理学会論文誌, 50, 2, pp.501-513 (2009.2).

2008

K.Cho, H.Okumura, T.Nishiura and Y.Yamashita
Localization of Multiple Sound Sources Based on Inter-Channel Correlation Using a Distributed Microphone System
Proc. of the Interspeech 2008 (Interspeech2008), pp.443-446 (2008.9).

T.Akiba, K.Aikawa, Y.Itoh, T.Kawahara, H.Nanjo, H.Nishizaki, N.Yasuda, Y.Yamashita and K.Itou
Test Collections for Spoken Document Retrieval from Lecture Audio Data
Proc. of International Conference on Language Resources and Evaluation (LREC2008) (2008.5).

Y.Denda, T.Nishiura and Y.Yamashita
Omnidirectional Audio-Visual Talker Localization Based on Dynamic Fusion of Audio-Visual Features Using Validity and Reliability Criteria
IEICE Transactions on Information and Systems, E91-D, 3, pp.598-606 (2008.3).

2007

K.Cho, T.Nishiura and Y.Yamashita
3-Dimentional Sound Source Localization Using a Distributed Microphone System
Proc. of the 19th International Congress on Acoustics (ICA2007), CAS-04-007, pp.1-6 (2007.9).

Y.Denda, T.Nishiura and Y.Yamashita
Omnidirectional Audio-Visual Talker Localization with Dynamic Feature Fusion based on Validity and Reliability Criteria
Proc. of the 8th Annual Conference of the International Speech Communication Association (Interspeech2007), WeB.P1b-7, pp.726-729 (2007.8).

Y.Denda, T.Tanaka, M.Nakayama, T.Nishiura and Y.Yamashita
Noise-Robust Hands-free Voice Activity Detection with Adaptive Zero Crossing Detection using Talker Direction Estimation
Proc. of the 8th Annual Conference of the International Speech Communication Association (Interspeech2007), TuC.P3a-1, pp.222-225 (2007.8).

2006

Y.Denda, T.Nishiura and Y.Yamashita
A Study of Robust Omnidirectional Audio-Visual Talker Localization Algorithm with Microphone Array and Omnidirectional Image
4th Joint Meeting of the ASA and the ASJ, 1pSC24 (2006.11).

M.Nakayama, T.Nishiura and Y.Yamashita
A Design of Fast Steering Filters Based on the Adaptive Fusion of Predesigned Finite Impulse Response Filters for Microphone Array
4th Joint Meeting of the ASA and the ASJ, 3pSP2 (2006.11).

K.Cho, H.Okumura, T.Nishiura and Y.Yamashita
Sound Source Localization Using a Distributed Microphone System in Real Environments
4th Joint Meeting of the ASA and the ASJ, 3pSP42 (2006.11).

Y.Denda, T.Nishiura and Y.Yamashita
A Design of Robust Omnidirectional Audio-Visual Talker Localizer
Proc. of IASTED Internet and Multimedia Systems and Applications (IMSA2006), pp.210-215 (2006.8).

Y.Denda, T.Nishiura and Y.Yamashita
Robust Talker Localization Algorithm Based on Weighted CSP Analysis and Maximum Likelohood Estimation
The 9th Western Pacific Acoustics Conference (WESPAC IX 2006) (2006.6).

M.Nakayama, T.Nishiura and Y.Yamashita
Hands-Free Speech Recognition With Average Phoneme-based AMNOR
The 9th Western Pacific Acoustics Conference (WESPAC IX 2006) (2006.6).

Y.Denda, T.Nishiura and Y.Yamashita
Robust Talker Direction Estimation Based on Weighted CSP Analysis and Maximum Likelihood Estimation
IEICE Transactions on Information and Systems, E89-D, 3, pp.1050-1057 (2006.3).

2005

Y.Yamashita
Concept-to-Speech Conversion System for Spoken Dialogue Systems
in Spoken Language Systems, eds. S.Nakagawa, M.Okada, and T.Kawahara, Ohmsha, pp.101-112 (2005.10).

Y.Denda, T.Nishiura and Y.Yamashita
A Study of Weighted CSP Analysis with Average Speech Spectrum for Noise Robust Talker Localization
Proc. of 9th European Conference on Speech Communication and Technology (Interspeech2005), pp.2321-2324 (2005.9).

Y.Denda, T.Nishiura and Y.Yamashita
Noise Robust Talker Localization Based on Weighted CSP Analysis With an Average Speech Spectrum for Microphone Array Steering
Proc. of International Workshop on Acoustic Echo and Noise Control (IWAENC2005), pp.165-168 (2005.9).

K.Cho, T.Ichimaru and Y.Yamashita
Speech Recognition Using Inter-Phoneme Dependency Based on a Speaker Space Model
Systems and Computers in Japan, 36, 8, pp.15-22 (2005.7).

Y.Yamashita, K.Kato and K.Nozawa
Automatic Scoring for Prosodic Proficiency of English Sentences Spoken by Japanese Based on Utterance Comparison
IEICE Transactions on Information and Systems, E88-D, 3, pp.496-501 (2005.3).
PDF-file (250023 bytes)

2004

K.Cho and Y.Yamashita
Determination of the Number of Candidates Using Recognition Scores for N-best Based Speech Interface
Proc. of the 6th IASTED International Conference on Signal and Image Processing (SIP2004), pp.268-272 (2004.8).
PDF-file (279747 bytes)

K.Cho and Y.Yamashita
Speech Recognition Using Inter-phoneme Dependency Based on a Speaker Space Model
Proc. of the 18th International Congress on Acoustics (ICA2004), 5, pp.3507-3510 (2004.4).

A.Inoue, T.Mikami and Y.Yamashita
Improvement of Speech Summarization Using Prosodic Information
Proc. of Speech Prosody 2004 (SP2004), pp.599-602 (2004.3).
PDF-file (236072 bytes)

M.Fukui, Y.Yamashita, H.Saruwatari and K.Shikano
Pitch Recognition for Successive Musical Cords Using Short-Term Estimation Method for Peak Frequencies
Proc. International Symposium on Musical Acoustics (ISMA2004) (2004.3).

2003

A.Inoue, T.Mikami and Y.Yamashita
Prediction of Sentence Importance for Speech Summarization Using Prosodic Parameters
Proc. of 8th European Conference on Speech Communication and Technology (Eurospeech 2003), 1, pp.1193-1196 (2003.9).

A.Inoue and Y.Yamashita
Speech Summarization of Lecture Speech Using F0 Parameters
Proc. of the Eighth Western Pacific Acoustic Conference (WESPAC8), TB45 (2003.4).

K.Cho and Y.Yamashita
Speech Recognition Using Inter-Phoneme Dependency
Proc. of the Eighth Western Pacific Acoustic Conference (WESPAC8), MB32 (2003.4).

Y.Yamashita, T.Ishida and K.Shimadera
A Stochastic F0 Contour Model Based on Clustering and a Probabilistic Measure
IEICE Transactions on Information and Systems, E86-D, 3, pp.543-549 (2003.3).

A.Inoue and Y.Yamashita
Extraction of important sentences for speech summarization based on an F0 model
The Journal of the Acoustical Society of Japan (E), 24, 1, pp.35-37 (2003.1).

2002

Y.Yamashita and A.Inoue
Extraction of Important Sentences Using F0 Information for Speech Summarization
Proc. of 7th International Conference on Spoken Language Processing (ICSLP2002), 2, pp.1181-1184 (2002.9).
Abstract | PDF-file (223911 bytes) | PS-file (164354 bytes)

2001

Y.Yamashita and T.Ishida
Stochastic F0 Contour Model Based on the Clustering of F0 Shapes of a Syntactic Unit
Proc. of 7th European Conference on Speech Communication and Technology (Eurospeech 2001), 1, pp.533-536 (2001.9).
Abstract | PDF-file (92626 bytes) | PS-file (371920 bytes)

Y.Yamashita, D.Iwahashi and R.Mizoguchi
Keyword Spotting Using F0 Contour Information
Systems and Computers in Japan, 32, 7, pp.52-61 (2001.7).

2000

Y.Yamashita and M.Murai
An Annotation Scheme of Spoken Dialogues with Topic Break Indexes
Proc. of 6th International Conference on Spoken Language Processing (ICSLP2000), 1, pp.569-572 (2000.10).
Abstract | PDF-file (100390 bytes) | PS-file (38821 bytes)

1999

Y.Yamashita
Prediction of Keyword Spotting Accuracy Based on Simulation
Proc. of 6th European Conference on Speech Communication and Technology (Eurospeech '99), 3, pp.1235-1238 (1999.9).
Abstract | PDF-file (163428 bytes) | PS-file (261579 bytes)

A.Ichikawa, M.Araki, Y.Horiuchi, M.Ishizaki, S.Itabashi, T.Itoh, H.Kashioka, K.Kato, H.Kikuchi, H.Koiso, T.Kumagai, A.Kurematsu, K.Maekawa, S.Nakazato, M.Tamoto, S.Tutiya, Y.Yamashita and T.Yoshimura
Evaluation of Annotation Schemes for Japanese Discourse
Proc. of ACL '99 Workshop on Towards Standards and Tools for Discourse Tagging (ACL-WS '99), pp.26-34 (1999.6).

1998

Y.Yamashita, T.Tsunekawa and R.Mizoguchi
Topic Recognition for News Speech Based on Keyword Spotting
Proc. of 5th International Conference on Spoken Language Processing (ICSLP '98), 3, pp.839-842 (1998.12).
Abstract | PDF-file (295182 bytes) | PS-file (116655 bytes)

A.Ichikawa, M.Araki, M.Ishizaki, S.Itabashi, T.Itoh, H.Kashioka, K.Kato, H.Kikuchi, T.Kumagai, A.Kurematsu, H.Koiso, M.Tamoto, S.Tutiya, S.Nakazato, Y.Horiuchi, K.Maekawa, Y.Yamashita and T.Yoshimura
Standardising Annotation Schemes for Japanese Discourse
Proc. of First International Conference on Language Resource and Evaluation (LREC '98), pp.731-736 (1998.5).

1997

Y.Yamashita and R.Mizoguchi
Keyword Spotting Using F0 Contour Matching
Proc. of 5th Conference on Speech Communication and Technology (Eurospeech '97), 1, pp.271-274 (1997.9).
Abstract | PS-file (471174 bytes)