著者
Ryosuke O. Tachibana Yasunari Sasaki Hiroshi Riquimaroux
出版者
一般社団法人 日本音響学会
雑誌
Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日
vol.34, no.4, pp.263-270, 2013-07-01 (Released:2013-07-01)
参考文献数
29
被引用文献数
1 11

In this study, we investigated how the temporal envelopes contribute to the recognition of isolated syllables, words, and sentences in noise-vocoded speech, under comparison with the influence of spectral resolution. The spectral and temporal resolutions of speech materials were systematically manipulated by a noise-vocoding technique. Japanese monomoraic syllables, meaningful and meaningless words, and sentences were used as test speech materials. The original speech sound was spectrally separated by a filter bank, and each spectral band was replaced with band noise, preserving its original temporal envelope. Spectral resolution was controlled by varying the number of spectral bands as 4, 8, or 16 bands. Temporal resolution was altered by smoothing the amplitude envelope using a low-pass filter with cut-off frequencies at 4, 8, or 16 Hz. Results demonstrated an increased dependence on temporal resolution in comparison with spectral resolution for the recognition of speech materials at higher structural levels (i.e., monosyllables, words, and sentences). This increased temporal dependence suggests that the temporal envelope in noise-vocoded sentences contained a greater amount of suprasegmental information, such as coarticulations, prosodies, and rhythms, in the original speech sound.
著者
Gerard B. Remijn Emi Hasuo Haruna Fujihira Satoshi Morimoto
出版者
一般社団法人 日本音響学会
雑誌
Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日
vol.35, no.5, pp.229-242, 2014-05-01 (Released:2014-09-01)
参考文献数
109
被引用文献数
11

In 1939, Pauline Davis reported the first study on event-related potentials (ERPs) performed on awake humans. ERPs are time-locked brain potentials that occur in response to cognitive, motor or perceptual events. The events used by Davis were sounds, and in the decades that followed her landmark study ERP research significantly contributed to the knowledge of auditory perception and neurophysiology we have today. ERPs are very well suited to study neural responses to sound stimuli, since the researcher can monitor the brain's registration of sound edges and spectral changes in sound on a millisecond-by-millisecond basis. In this overview we will introduce basic concepts of auditory ERP research. The overview includes descriptions of typical ERP components, experimental paradigms, sound stimuli, research methodology, and ways to analyze data.
著者
Greg Short Keikichi Hirose Nobuaki Minematsu
出版者
一般社団法人 日本音響学会
雑誌
Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日
vol.35, no.2, pp.73-85, 2014-02-01 (Released:2014-03-01)
参考文献数
34

For Japanese speech processing, being able to automatically recognize between geminate and singleton consonants can have many benefits. In standard recognition methods, hidden Markov Models (HMMs) are used. However, HMMs are not good at differentiating between items that are distinguished primarily by temporal differences rather than spectral differences. Also, gemination depends on the length of the sounds surrounding the consonant. Because of this, we propose the construction of a method that automatically distinguishes geminates from singletons and takes these factors into account. In order to do this, it is necessary to determine which surrounding sounds are cues and what the mechanism of human recognition is. For this, we conduct perceptual experiments to examine the relationship between surrounding sounds and primary cues. Then, using these results, we design a method that can automatically recognize gemination. We test this method on two datasets including a speaking rate database. The results attained well-outperform the HMM-based method and overall outperform the case when only the primary cue is used for recognition as well as show more robustness against speaking rate.
著者
Yasuhito Kawai Masahiro Toyoda
出版者
一般社団法人 日本音響学会
雑誌
Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日
vol.35, no.1, pp.28-34, 2014-01-01 (Released:2014-01-01)
参考文献数
13
被引用文献数
2 4

Noise barriers are often very tall alongside highways with heavy traffic. Although these high barriers ensure the desired amount of noise attenuation, they are expensive to install and have a negative effect on the landscape. Consequently, many types of edge-modified noise barriers have been proposed to reduce the necessary height. Herein an alternative noise barrier based on the ``edge-effect'' suppression technique is proposed, and the sound insulation performance is investigated both theoretically and experimentally. Numerical examples indicate that the diffracted sound is greatly attenuated by suppressing the particle velocity in the region with a large velocity amplitude using a thin absorbing material such as cloth with a gradational distribution in impedance. The experimental and theoretical results of insertion loss are in good agreement, validating the theoretical consideration and effectiveness of the cloth installed at the top of the barrier.
著者
Takuya Oshima Yasuhiro Hiraguri Masashi Imano
出版者
一般社団法人 日本音響学会
雑誌
Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日
vol.35, no.2, pp.108-118, 2014-02-01 (Released:2014-03-01)
参考文献数
27
被引用文献数
1 3

With the recent progresses in computer performance and simulation techniques, it is becoming feasible to apply full three-dimensional wave-based numerical simulation techniques to large-scale problems of real-life sound propagation outdoors. In the present paper, a reconstruction technique for real-life urban geometries with full reproduction of the roof shapes and for the ground profiles using digital geographic information is presented. Also, a generation technique for the uniform rectilinear grid used in finite-difference time-domain simulations is presented. The types of geographic dataset used for the reconstruction are a digital surface model and a two-dimensional building outline map. For comparison, another geometry with flat building roofs, which is the type of geometry used in former noise-mapping studies using empirical models, is created. Comparison of the results of finite-difference time-domain acoustic simulations performed over the geometries shows sound pressure level differences above and behind buildings. The maximum level difference of 10 dB in magnitude indicates the necessity of proper reconstruction of the roof shapes in real-life urban acoustic simulations.
著者
伊藤 憲三 水島 昌英 北脇 信彦
出版者
一般社団法人 日本音響学会
雑誌
日本音響学会誌 (ISSN:03694232)
巻号頁・発行日
vol.61, no.8, pp.431-440, 2005
参考文献数
28
被引用文献数
1

入力信号を音声とそれ以外の信号(非音声)に識別して, 定常雑音を効率的に抑圧する方式を提案した。音声と非音声の識別には, 入力信号の周期性を表す特徴量, 信号スペクトル特徴, パワーなどを用いた。雑音抑圧処理は, スペクトルサブトラクション(SS)とロスコントロール(LC)を併用した。SSでは, 周波数重みづけによって雑音を差し引くこととし, また, LCはSSで消し残った残留雑音を損失制御することによって無音区間の残留雑音を完全に除去した。種々の雑音を用いて評価実験を行った結果, 信号対雑音比が10dB程度以上の雑音条件下で良好な動作をすることを示した。また, 本方式をマルチマイク集音系と組み合わせることによって更に性能を向上させることを示した。更に, 聴覚障害者による数字了解度試験を実施し, 提案した雑音抑圧処理が難聴者の聴こえに非常に有効であることを示した。
著者
Naofumi Aoki Yota Morimoto
出版者
一般社団法人 日本音響学会
雑誌
Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日
vol.34, no.5, pp.342-343, 2013-09-01 (Released:2013-09-01)
参考文献数
5
被引用文献数
2
著者
Masayuki Takada Yoko Fukuda Shin-ichiro Iwamiya
出版者
一般社団法人 日本音響学会
雑誌
Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日
vol.34, no.3, pp.187-196, 2013-03-01 (Released:2013-05-01)
参考文献数
20
被引用文献数
1 4

The Japanese Road Traffic Law states that a vehicle horn should be used only in an emergency or in dangerous locations where a horn sign has been installed. However, drivers frequently use their vehicle horn as a signal when passing each other. Furthermore, pedestrians and cyclists are also honked at by drivers. There may be a negative effect of vehicle horn use on people outside the vehicle. Therefore, a questionnaire survey was carried out to ascertain the current circumstances of vehicle horn use. Respondents were asked about the latest or last-remembered instance of horn use in various situations, both when they were driving and when they were pedestrians. With regard to experiences of being honked at by another driver, the questionnaire included questions concerning place, traffic volume, the aim of horn use, the timing pattern of horn use, and the respondent's psychological reaction when honked at by another driver. It was found that drivers briefly honked their horn to express gratitude, or to gain another's attention, in various places. Long honks such as one long honk or two long honks mostly evoked negative psychological reactions in drivers, such as a feeling that the sound of the horn was startling, noisy, or irritating. Furthermore, pedestrians frequently had negative psychological reactions to horns used to gain their attention or to alert them to danger. More than 40% of nondrivers were displeased with the volume and sound quality of horns.
著者
Madoka Takimoto Takanori Nishino Katunobu Itou Kazuya Takeda
出版者
一般社団法人 日本音響学会
雑誌
Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日
vol.28, no.5, pp.335-342, 2007 (Released:2007-09-01)
参考文献数
13
被引用文献数
1 5

In this paper, we examine how covering one or both external ears affects sound localization on the horizontal plane. In our experiments, we covered subjects’ pinnae and external auditory canals with headphones, earphones, and earplugs, and conducted sound localization tests. Stimuli were presented from 12 different directions, and 12 subjects participated in the sound localization tests. The results indicate that covering one or both ears decreased their sound localization performance. Front-back confusion rates increased, particularly when covering both outer ears with open-air headphones or covering one ear with an intraconcha-type earphone or an earplug. Furthermore, incorrect answer rates were high when the sound source and the occluded ear that had an intraconcha-type earphone or an earplug were on the same side. We consider that the factors that cause poor performance can be clarified by comparing these results with characteristics of head-related transfer function.
著者
Kanako Ueno Hideki Tachibana
出版者
一般社団法人 日本音響学会
雑誌
Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日
vol.26, no.2, pp.156-161, 2005 (Released:2005-03-01)
参考文献数
7
被引用文献数
3 5 18

In order to investigate the effect of a hall response on music players, we have made various experimental studies up to now. In order to develop this research, one of the most important subjects is to understand the musicians’ perception of the acoustic effects of halls. Musicians generally perceive the acoustic properties of a concert hall by keeping their performing action and by adjusting their playing technique subconsciously. This kind of interactive relationship between musicians and acoustic environment is really important when considering the acoustic values of concert halls for musicians. In this study, musicians’ awareness of concert halls was investigated through interview survey and the cognitive psychological phenomena of musicians were interpreted by applying the “tacit knowing” theory. Then the process to extract the musicians’ perception in experimental studies on concert hall acoustics is discussed.
著者
Tatsuya Yoshizawa Gerard Bastiaan Remijn Takumi Kitamura
出版者
一般社団法人 日本音響学会
雑誌
Acoustical Science and Technology (ISSN:13463969)
巻号頁・発行日
vol.33, no.3, pp.147-153, 2012-05-01 (Released:2012-04-29)
参考文献数
36

Dichotic listening studies have shown that information relevant to listeners, such as their own name, can be recognized even when presented to the unattended ear. Here, we used a dichotic listening paradigm to explore whether Japanese listeners could identify their name in the unattended ear even when sensory information was incomplete. The results showed that Japanese listeners with family names of 3, 4, or 5 morae — a speech unit equivalent to a syllable in English — recognized their name in about 20–60% of the trials even when the first or the last mora of the name was omitted. The data further showed a name-final effect under the 4- and 5-morae conditions: name recognition significantly decreased when the last mora of the listener’s name was omitted as compared with the omission of the first mora. A possible explanation for these results is that self-relevant information, even when incomplete, automatically draws attention to the supposedly unattended ear and that the listener’s recognition of the information is more robust when its end part is presented.