CrestMuse Project

Publications

	2005		2006		2007		2008		2009		2010

Peer-reviewed Journal Papers

Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Parameter Estimation for Harmonic and Inharmonic Models by Using Timbre Feature Distributions IPSJ Journal, Vol.50, No.7 (Jul. 2009) 1757-1767, IPSJ. Journal of Information Processing, Vol.17 (2009) 191-201, IPSJ.
Osamu Fujimura, Kiyoshi Honda, Hideki Kawahara, Yasuyuki Konparu and Masanori Morise, Noh Voice Quality, J. Logopedics Phoniatrics Vocology, Vol.34(4), pp.157-170, 2009. DOI: 10.1080/14015430903002288
Hiromasa Fujihara, Masataka Goto, Tetsuro Kitahara Hiroshi G. Okuno: A Modeling of Singing Voice Robust to Accompaniment Soundｓ and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval, IEEE Transactions on Audio, Speech and Language Processing, Vol.18, No.3 (March 2010) 638-648, IEEE. doi: 10.1109/TASL.2010.2041386
Tetsuro Kitahara: "Mid-level Representations of Musical Audio Signals for Music Information Retrieval", Advances in Music Information Retrieval, Springer, Volume 274，2010，ｐｐ．65-91，2010.3. doi：10.1007/978-3-642-11674-2_4

Book Chapters

Stanislaw Andrzej Raczynski, Nobutaka Ono, Shigeki Sagayama, “Extending Nonnegative Matrix Factorization--a discussion in the context of mulipitch frequency estimation of musical signals,” Proc. of EUSIPCO, pp.934-938, Aug., 2009.
Nobutaka Ono, Kenichi Miyamoto, Hirokazu Kameoka, Jonathan Le Roux, Yuuki Uchiyama, Emiru Tsunoo, Takuya Nishimoto, Shigeki Sagayama, “Harmonic and Percussive Sound Separation and its Application to MIR-related Tasks,” Advances in Music Information Retrieval, ser. Studies in Computational Intelligence, Z. W. Ras and A. Wieczorkowska, Eds. Springer, 274, pp.213-236, Feb., 2010.

Peer-reviewed Conference Papers

H. Kawahara, R. Nisimura, T. Irino, M. Morise, T. Takahashi, B. Banno, Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown, Proc. ICASSP, Taipei, Taiwan, 19-24 (2009). (2009.4, Taipei)
Emiru Tsunoo, Nobutaka Ono, Shigeki Sagayama, “Rhythm Map: Extraction of Unit Rhythmic Patterns and Analysis of Rhythmic Structure from Music Acoustic Signals,” Proc. of ICASSP, pp.185-188, Apr., 2009.
Emiru Tsunoo, George Tzanetakis, Nobutaka Ono, Shigeki Sagayama, “Audio Genre Classification Using Percussive Pattern Clustering Combined with Timbral Features,” International Conference on Multimedia and Expo, pp.382-385, Jun., 2009.
Ryo Kanda, Mitsuyo Hashida and Haruhiro Katayose, Mims: Interactive Multimedia Live Performance System . Proc. NIME 2009 (June 2009)
Otsuka, T., Hosokawa, T., Kazai, K. & Katayose, H. Concealed information test of simultaneously recording with hemodynamic responses and autonomic responses：The 8th Annual Meeting of the Society for Applied Research in Memory and Cognition, Kyoto, Japan. 2009.07.29.
Masataka Goto: Augmented Music-Understanding Interfaces, The 6th Sound and Music Computing Conference (SMC 2009): Inspirational Session, July 2009.
Mitsuyo Hashida and Haruhiro Katayose: Mixtract: A Directable Musical Expression System, Proc. of Affective Computing and Intelligent Interaction (ACII) 2009, pp.xxx-xxx, 2009
Tetsuro Kitahara, Naoyuki Totani, Ryosuke Tokuami, and Haruhiro Katayose: "BayesianBand: Jam Session System based on Mutual Prediction by User and System", Entertainment Computing: Proceedings of the 10th International Conference on Entertainment Computing (ICEC 2009), pp.179--184, September 2009.
M. Morise, M. Onishi, H. Kawahara and H. Katayose，"v.morish'09: A morphing-based singing design interface for vocal melodies, '' Lecture Note in Computer Science, LNCS 5709 (in Proc of ICEC 2009), pp.185-190, Sept. 2009.
Kazuyoshi Yoshii and Masataka Goto: MusicCommentator: Generating Comments Synchronized with Musical Audio Signals by a Joint Probabilistic Model of Acoustic and Textual Features, Proceedings of the 8th International Conference on Entertainment Computing (ICEC 2009) (Lecture Notes in Computer Science), pp.85-97, September 2009.
M. Morise, H. Kawahara and T. Nishiura: Rapid F0 estimation for high-SNR speech, Proc, WESPAC2009, Beijing, China, CD-ROM, Sept. 21-23, Beijing, 2009
H. Kawahara, T. Takahashi, M. Morise and H. Banno, ``Development of exploratory research tools based on TANDEM-STRAIGHT,'' Proc, APSIPA 2009, pp.111-120, Sapporo, Oct. 4-7, 2009. (2009.10.5, Sapporo)
Kazuyoshi Yoshii and Masataka Goto: Continuous pLSI and Smoothing Techniques for Hybrid Music Recommendation, Proceedings of the 10th International Society for Music Information Retrieval Conference (ISMIR 2009), pp.339-344, October 2009.
H. Itagaki, M. Morise, T. Nisimura, T. Irino and H. Kawahara: A bottom-up procedure to extract periodicity structure of voiced sounds and its application to represent and restoration of pathological voices, MAVEBA09, Firenze Italy, 2009.12
Katsutoshi Itoyama, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: QUERY-BY-EXAMPLE MUSIC RETRIEVAL APPROACH BASED ON MUSICAL GENRE SHIFT BY CHANGING INSTRUMENT VOLUME, Proceeding of the 12th International Conference on Digital Audio Effects (DAFx-09), accepted, Como, Italy, Sep.1-4. 2009.
Naoki Yasuraoka, Takehiro Abe, Katsutoshi Itoyama, Kazuyoshi Yoshii, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Changing Timbre and Phrase in Existing Musical Performances as You Like, ACM Multimedia 2009, 203-212, Beijing, China, Oct. 19-24, 2009. doi:10.1145/1631272.1631302.
Hiromasa Fujihara, Masataka Goto, Hiroshi G. Okuno: A NOVEL FRAMEWORK FOR RECOGNIZING PHONEMES OF SINGING VOICE IN POLYPHONIC MUSIC, Proceedings of 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2009), Oct. 18-21, New Paltz, NY, 2009.
Satoru Fukayama, Kei Nakatsuma, Shinji Sako, Yu-ichiro Yonebayashi, Tae Hun Kim, Qin Si Wei, Takuho Nakano, Takuya Nishimoto, Shigeki Sagayama, “Orpheus: Automatic Composition System Considering Prosody of Japanese Lyrics,” Entertainment Computing - ICEC 2009, pp.309-310, Sep., 2009.
Stanislaw Andrzej Raczynski, Nobutaka Ono, Shigeki Sagayama, “Note detection with dynamic Bayesian networks as a postanalysis step for NMF-based multiple pitch estimation techniques,” Proc. WASPAA, pp.49-52, Oct., 2009.
Emiru Tsunoo, Nobutaka Ono, Shigeki Sagayama, “Musical Bass-Line Pattern Clustering and Its Application to Audio Genre Classification,” Proc. of ISMIR, pp.219-224, Oct., 2009.
Jeremy Reed, Yushi Ueda, Sabato Marco Siniscalchi, Yuki Uchiyama, Shigeki Sagayama, Chin-Hui Lee, “Minimum Classification Error Training to Improve Isolated Chord Recognition,” Proc. of ISMIR, pp.609-614, Oct., 2009.
Akira Maezawa, Katsutoshi Itoyama, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Bowed String Sequence Estimation of a Violin Based on Adaptive Audio Signal Classification and Context-Dependent Error Correction, Proceedings of IEEE International Symposium on Multimedia (ISM2009), accepted for full paper presentation (acceptance rate for full papers, 19.6%), San Diego, Dec. 14-16, 2009.
Takuma Ohtsuka, Kazuhiro Nakadai, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Voice quality manipulation for humanoid robots consistent with their head movements, Proceedings of IEEE-RAS Interanational Conference on Humanoid Robots (Humanoids 2008), pp.405-410, IEEE, Paris, Dec. 7-10, 2009.
Yushi Ueda, Yuuki Uchiyama, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama, “HMM-based Approach for Automatic Chord Detection using Refined Acoustic Features,” Proc. of ICASSP, Mar., 2010.
Emiru Tsunoo, Taichi Akase, Nobutaka Ono, Shigeki Sagayama, “Music Mood Classification by Rhythm and Bass-line Unit Pattern Analysis,” Proc. of ICASSP, pp.265-268, Mar., 2010.
Hideyuki Tachibana, Takuma Ono, Nobutaka Ono, Shigeki Sagayama, “MELODY LINE ESTIMATION IN HOMOPHONIC MUSIC AUDIO SIGNALS BASED ON TEMPORAL-VARIABILITY OF MELODIC SOURCE,” Proc. of ICASSP, pp.425-428, Mar., 2010.
Kazai, K., Konishi, K., Bennett, P. J., Sekuler, A. B., Yagi, A., Katayose, H., and Nagai, M. Structural encoding of schematic face: an event-relatedbrain potential investigation. The 15th Annual Meeting of the Organization for Human Brain Mapping, San Francisco, U.S.A. 2009.06.22.
Hosokawa, T., Kazai, K. & Katayose, H. The validity of the fNIRS recording in the prefrontal cortex for lie detection. The 15th Annual Meeting of the Organization for Human Brain Mapping, San Francisco, U.S.A. 2009.06.23.
Toshie Matsui, Koji Kazai, Haruhiro Katayose and Minoru Tsuzaki (2009). "Activation of musicians' brains during phrase segmentation of actual music: An fMRI study"," Proc. of Human Brain Mapping: California, USA, June 18-23.
Hideki Kawahara, Masanori Morise, Toru Takahashi, Hideki Banno, Ryuichi Nisimura and Toshio Irino: Vocoder-based morphing tool demonstrations for flexible voice manipulations, AES 14th Regional Convention, Tokyo, (2009.5.13, Tokyo).
Tomoyasu Nakano and Masataka Goto: VocaListener: A Singing-to-Singing Synthesis System Based on Iterative Parameter Estimation, Proceedings of the 6th Sound and Music Computing Conference (SMC 2009), pp.343-348, July 2009.
Takeshi Saitou and Masataka Goto: Acoustic and Perceptual Effects of Vocal Training in Amateur Male Singing, Proceedings of the 10th Annual Conference of the International Speech Communication Association (Interspeech 2009), pp.832-835, September 2009.
H. Kawahara, M. Morise, T. Takahashi, H. Banno, R. Nisimura and T. Irino, ``Observation of empirical cumulative distribution of vowel spectral distances and its application to vowel based voice conversion,'' Proc. Interspeech2009, pp.2647-2650, 2009. (2009.9.10, Brighton UK)
H. Kawahara, R. Nisimura, T. Irino, M. Morise, T. Takahashi and H. Banno: High-quality and light-weight voice transformation enabling extrapolation without perceptual and objective breakdown, ICASSP2010, Dallas USA (2009.3)
Yu Kitano, Hirokazu Kameoka, Yosuke Izumi, Nobutaka Ono, Shigeki Sagayama, “A Sparse Component Model of Source Signals and Its Application to Blind Source Separation,” Proc. of ICASSP, pp.4122-4125, Mar., 2010.

Inveted Talks

Takeshi Saitou, Masataka Goto, Masashi Unoki, and Masato Akagi: Invited talk "Speech-to-Singing Synthesis System: Vocal Conversion from Speaking Voices to Singing Voices by Controlling Acoustic Features Unique to Singing Voices" in the special session "International Symposium on Speech and language Processing Technology I" of the 10th National Conference on Man-Machine Speech Communication (NCMMSC 2009), Lanzhou, China, August 15, 2009.
Hiroshi G. Okuno, Kazuhiro Nakadai, Hyun-Don Kim: Robot Auditon: Missing Feature Theory Approach and Active Audition (Invited talk), Proceeding of the 14th International Symposium of Robotics Research (ISRR 2009), August 31 - September 3, 2009, Lucerne, Switzerland.
Takuma Ohtsuka, Kazumasa Murata, Toru Takahashi, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno: Incremental Polyphonic Audio to Score Alignment using Beat Tracking for Singer Robots (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.2289-2296, IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009.
Takeshi Mizumoto, Hiroshi Tsujino, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno: Thereminist Robot: Development of a Robot Theremin Player with Feedforward and Feedback Arm Control based on a Theremin's Pitch Model (Invited paper), Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2009), pp.2297-2302, IEEE, RSJ, St. Louis, 12-14 (13) Oct. 2009.
Masataka Goto: Keynote speech " Augmented Music-Understanding Interfaces: Toward Music Listening in the Future" in the International Workshop on Advances in Music Information Research 2009 (AdMIRe 2009) of the IEEE International Symposium on Multimedia 2009 (ISM 2009), San Diego, California, USA, December 16, 2009.
Hideki Kawahara: Speech morphing based on biologically relevant signal representations, MAVEBA09, Firenze Italy, 2009.12.
Masataka Goto, Takeshi Saitou, Tomoyasu Nakano, and Hiromasa Fujihara: Singing Information Processing Based on Singing Voice Modeling, Proceedings of the 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp.5506-5509, March 2010. (Invited Paper)