著者
Hideki Kakeya Ken Okada Hayato Takahashi
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.6, no.3, pp.237-246, 2018 (Released:2018-07-01)
参考文献数
25
被引用文献数
14

This paper presents a full-HD autostereoscopic display with a wide viewing zone based on time-division multiplexing slanted parallax barrier with subpixel-based slit control. A slanted directional diffuser placed between the barrier panel and the image panel suppresses moir_ by mixing light from RGB subpixels. By constituting the parallax barrier along with this slanted diffusion line, control of the barrier by subpixel unit is enabled. By introducing subpixel-based phase shift and slit width control to time-division quadruplexing parallax barrier, viewing zone free from crosstalk is enlarged notably. Theoretical viewing zone is calculated and compared with the experimental results using a prototype hardware. The overall crosstalk of the prototype is also measured to confirm the effectiveness of the proposed method.
著者
Garimagai Borjigin Hideki Kakeya
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.9, no.1, pp.80-85, 2021 (Released:2021-01-01)
参考文献数
22
被引用文献数
1 5

In this paper, we propose autostereoscopic displays with novel directional backlight designs. A high level of stereoscopic crosstalk is the main problem to be solved in the conventional systems. To reduce crosstalk, we propose a directional backlight system that suppresses the effect of field curvature only with a single layer of curved lens array. It is confirmed that the crosstalk level is reduced notably by the proposed methods. The uniformity of backlight intensity is also increased by using a lens array composed of trapezoid elemental lenses in place of rectangle elemental lenses.
著者
Yusuke Matsui Yusuke Uchida Hervé Jégou Shin'ichi Satoh
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.6, no.1, pp.2-10, 2018 (Released:2018-01-01)
参考文献数
53
被引用文献数
1 21

Product Quantization (PQ) search and its derivatives are popular and successful methods for large-scale approximated nearest neighbor search. In this paper, we review the fundamental algorithm of this class of algorithms and provide executable sample codes. We then provide a comprehensive survey of the recent PQ-based methods.
著者
Takefumi Hiraki Shogo Fukushima Hiroshi Watase Takeshi Naemura
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.7, no.4, pp.160-168, 2019 (Released:2019-10-01)
参考文献数
18
被引用文献数
4

We previously studied methods leveraging pixel-level visible light communication (PVLC) that embeds imperceptible information for human eyes in each pixel of an image. The PC computation load and amount of data transferred between the PC and projector in previous PVLC systems were excessive because the PC executed both the video and data encoding processes. As a result, it was impossible to achieve both high-dynamic-range images and dynamic updates of the images and data. In this paper, we propose a dynamic PVLC system that offers high video quality and interactively updates the PVLC information through hardware encoding processing. Our system can project a 24-bit gradation color PVLC video that contains 64-bit data at 120 fps by synchronously controlling the ON/OFF states of the DMD and LED light sources at the given performance limit of the projector.
著者
Bin Yang Hideki Kakeya
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.9, no.2, pp.136-142, 2021 (Released:2021-04-01)
参考文献数
15
被引用文献数
1

We propose an autostereoscopic display allowing two observers with adaptive fractional time-division multiplexing parallax barrier. Fractional time-division suppresses perceived flickers when the order of slit position is properly set. To make sure that both of the observers are located in the proper viewing zones to enable stereoscopy simultaneously, the number of time-division multiplexing is switched in accordance with the distance between them. The viewing zone without crosstalk for the second viewer is evaluated theoretically.
著者
Tomoyo Kikuchi Yuchi Yahagi Shogo Fukushima Saki Sakaguchi Takeshi Naemura
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.11, no.2, pp.75-87, 2023 (Released:2023-04-01)
参考文献数
31

Mid-air images in a three-dimensional space beyond a screen enable users to observe virtual content without wearing any devices. When using a symmetrical mirror structure with dual slit-mirror arrays, the system enables the display of a large mid-air image by integrating multiple imaging paths. However, in the mirror design used in previous research, the luminance was discontinuous. In this study, we propose a novel tabletop system in which a tall mid-air image with continuous luminance is superimposed onto physical objects. Our proposed system, called “AIR-range”, presents mid-air images that appear seamlessly from the table surface to mid-air. By theorizing the relationship between the parameters of optical systems and luminance of mid-air images, we optimized the optical systems to minimize the difference in luminance between the imaging paths. The results of the comparison with the previous method showed an improvement in luminance continuity.
著者
Koichi Ito Takafumi Aoki
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.6, no.1, pp.64-80, 2018 (Released:2018-01-01)
参考文献数
82
被引用文献数
9

This paper presents recent advances in biometric recognition, where we focus on face, fingerprint and iris recognition, which are major research topics on biometric recognition. We summarize the research trend of face, fingerprint and iris recognition over the past decade. This paper also presents our activities of biometric recognition. Our approach employs the phase information obtained by Discrete Fourier Transform (DFT) of images. The phase information preserves the inherent features of the image, and its correlation function, called phase correlation or Phase-Only Correlation (POC), gives us both the good similarity measure for biometric recognition and the translational displacement for image registration. Our approach of using phase information has been successfully applied to fingerprint, face, iris, palmprint, finger knuckle and dental recognition. Among them, we present some interesting results of palmprint recognition, finger knuckle recognition and dental recognition.
著者
田中 孝昌 外山 史 宮道 壽一 東海林 健二
出版者
The Institute of Image Information and Television Engineers
雑誌
映像情報メディア学会誌 : 映像情報メディア = The journal of the Institute of Image Information and Television Engineers (ISSN:13426907)
巻号頁・発行日
vol.64, no.12, pp.1933-1939, 2010-12-01
被引用文献数
7

Today, the demand is increasing for comic contents on cellular phones and speech software for the visually impaired. When speech software reads aloud a comic character's speech, it is useful for both the visually impaired and unimpaired to have the character's voice injected with his/her feeling, which is inferred from types of speech balloons. As a result, comic contents come to life. In this research, a method has been developed to detect speech balloons on comic pages and then classify them into four types. In this method, speech balloon candidates are extracted based on speech text information detected by AdaBoost, and then speech balloons are selected and classified using SVM. Experimental results show that the proposed method successfully detected and classified 86 percent of 2844 speech balloons.
著者
Hironari Takehara Ze Wang Honghao Tang Noriaki Kishida Yusuke Horiki Motoshi Sobue Makito Haruta Hiroyuki Tashiro Kiyotaka Sasagawa Jun Ohta
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.10, no.2, pp.59-68, 2022 (Released:2022-04-01)
参考文献数
38
被引用文献数
1

This study proposes to adapt the fundus camera for use as a personal healthcare tool. The proposed system uses near-infrared light to avoid blinding the subject and three-wavelength near-infrared imaging to acquire colorized fundus images. First, the optical system with the fundus camera was tested using a three-plate near-infrared snapshot camera. Subsequently, image processing and denoising techniques, including tracking and image integration, were applied to reduce the blur caused by biological scattering. Furthermore, a singlechip three-wavelength near-infrared-compatible image sensor is required for device miniaturization, for which a dielectric multilayer Fabry-Perot bandpass filter was adopted as its transmission wavelength can be easily controlled. In this study, the optical design of the dielectric multilayer structure, the fabrication process of the mosaic filter, and the chip mounting technology are investigated. The demosaicing process and color space conversion corresponding to the spectral response characteristics of the fabricated image sensor are also discussed.
著者
岡 芳樹 山本 正信
出版者
The Institute of Image Information and Television Engineers
雑誌
映像情報メディア学会誌 (ISSN:13426907)
巻号頁・発行日
vol.68, no.2, pp.J72-J77, 2014

At a theme park or entertainment show, actors wearing a cartoon-character costumes entertain guests. Unfortunately, existing cartoon-character costumes have heads with a fixed facial expression. We propose a novel cartoon-character costume where the head is equiped with a web camera inside and display panel outside. When an actor wears the head, the web camera captures the face of the actor. Based on the pattern classification technique, the facial expression is classified into one of five categories of emotion: anger, joy, sadness, surprise and neutrality. For each category, a corresponding facial image is chosen from a bank of facial images of cartoon characters. The chosen image is depicted on the display panel as the face in real time. The new costume head enables the actor to communicate with audiences interactively and play the other characters immediately by changing the facial image bank.
著者
Yiwei Zhang Xueting Wang Yoshiaki Sakai Toshihiko Yamasaki
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.9, no.4, pp.262-275, 2021 (Released:2021-10-01)
参考文献数
38
被引用文献数
1

Exploring brands that customers are likely to purchase jointly has a profound effect on marketing. This study proposes a new way to measure, or estimate the similarity between brands using social media. The proposed algorithm analyzes the daily photos and hashtags posted by each brand's followers. By clustering them and converting them into histogram-based features, we can calculate the similarity between brands. We evaluate our proposed algorithm by comparing it with the purchase logs of point/credit card companies, and answers to the questionnaires. The results show that purchase logs can predict the co-purchase behaviors in the questionnaires very well, but cannot predict customers' potential interest or willingness to buy products from new brands. On the other hand, our method can predict the users’ interest in brands with a correlation coefficient of over 0.53, which is high considering that such interest in brands is highly subjective and individual dependent.
著者
Hitoshi Nishimura Kazuyuki Tasaka Yasutomo Kawanishi Hiroshi Murase
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.8, no.4, pp.269-279, 2020 (Released:2020-10-01)
参考文献数
39
被引用文献数
3

In this paper, we propose a multiple human tracking method with alternately updating trajectories and mult iframe action features (MHT-MAF). Even though occlusion or motion blur occurs due to the sudden movement of the drone, ID switches are prevented by the stable MAF. In the experiments, we verified the effectiveness of the proposed method using the Okutama-Action dataset. Our code is available online (https://github.com/hitottiez/mht-paf).
著者
Satoshi Abe Takefumi Hiraki Shogo Fukushima Takeshi Naemura
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.8, no.3, pp.170-185, 2020 (Released:2020-07-01)
参考文献数
40
被引用文献数
2

Communication between screens and cameras has attracted attention as a ubiquitous information source, motivated by the widespread use of smartphones and the increase of public information screens. The method that encodes data into visible patterns impairs the user's visual experience. Previously, embedding matrix barcodes into images on displays by utilizing imperceptible color vibration was proposed. In this approach, the visual experience is maintained considering that barcodes are imperceptible, and it can be implemented on almost any display and camera. Herein, we describe a sophisticated modulation protocol and restoration procedure whereby device characteristics such as the display's gamma feature and the smartphone's rolling shutter are taken into consideration. Extensive experiments reveal the parameters for the modulation and that this system works under practical situations. In addition, scenarios of potential practical applications and a user study examining imperceptibility of barcodes and usability of the system are presented to illustrate the technological capabilities.
著者
Tomoki Haruyama Sho Takahashi Takahiro Ogawa Miki Haseyama
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.8, no.2, pp.89-99, 2020 (Released:2020-04-01)
参考文献数
41

The details of the matches of soccer can be estimated from visual and audio sequences, and they correspond to the occurrence of important scenes. Therefore, the use of these sequences is suitable for important scene detection. In this paper, a new multimodal method for important scene detection from visual and audio sequences in far-view soccer videos based on a single deep neural architecture is presented. A unique point of our method is that multiple classifiers can be realized by a single deep neural architecture that includes a Convolutional Neural Network-based feature extractor and a Support Vector Machine-based classifier. This approach provides a solution to the problem of not being able to simultaneously optimize different multiple deep neural architectures from a small amount of training data. Then we monitor confidence measures output from this architecture for the multimodal data and enable their integration to obtain the final classification result.
著者
安田 英史
出版者
The Institute of Image Information and Television Engineers
雑誌
映像情報メディア学会誌 (ISSN:13426907)
巻号頁・発行日
vol.68, no.2, pp.J61-J65, 2014

テレビ番組で頻繁に使用されるフリップは,放送の必須のアイテムである.これまでは紙製で使い捨てのフリップが主体であったが,昨今のいわゆるエコの考え方には逆行している.そこで筆者らは,タッチパネル機能付きの大型液晶モニタを用いた,生放送に耐えうる電子フリップシステムの研究・開発を行った.電子フリップのアプリケーションは,汎用的なツールを用いて作成し,システム構成も簡素化した.利用者の運用面の最大限の協力も相まって,コストパフォーマンスの良いシステムを作り上げることができた.
著者
関根 雅人 小川 克彦
出版者
The Institute of Image Information and Television Engineers
雑誌
映像情報メディア学会誌 (ISSN:13426907)
巻号頁・発行日
vol.67, no.12, pp.J463-J471, 2013

Motion graphics is a form of visual expression characterized by non-narrative, non-figurative based visuals that change over time. Due to the expansion of its application areas, a consideration of the affective quality of motion graphics is growing more important. This paper proposes an arousal estimation method that uses optical flow analysis as an affective quality assessment method for motion graphics. The primary objective is to verify two indexes: the total flow amount and the average magnitude of motion vectors. According to correlation analyses between each index and the arousal factor scores of video stimulus derived from an impression test with human participants, the correlation between the average magnitude of motion vectors and the arousal factor scores is significant. An analysis of the distribution of displacements showed that about three pixels per 33.3 msec is the border value: a higher ratio of movement slower than that border reduces the arousal level, and faster movement raises the level.
著者
藤井 勝之 伊藤 公一 田島 茂
出版者
The Institute of Image Information and Television Engineers
雑誌
映像情報メディア学会大会講演予稿集
巻号頁・発行日
pp.140, 2003 (Released:2004-03-26)

Studies of wearable computers have attracted public attention in these days. And one of the area of interest is the communication system adopted in those wearable computers. As anexample, wearable devices which use the human body as a transmission channel, have been developed. This communication system uses near field region of the electromagnetic wave generated by the device which is eventually coupled to human body by electrodes. However, little is known about the transmission mechanism of such devices in the physical layer. In this paper, we proposed calculation model of the transmitter attached to the arm of whole body in radio anechoic chamber using the FDTD method. From this model, we estimated the electric field distribution around the human body.