著者
Hirohito Shibata Kengo Omura
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.6, no.4, pp.255-261, 2018 (Released:2018-10-01)
参考文献数
32
被引用文献数
6 6

Handwriting is preferred when people take notes or annotate during listening to lectures or reading documents. This paper proves the effects of handwriting experimentally. Two experiments using a dual task method revealed that the cognitive load of handwriting was smaller than that of typing and typing interfered with memorization more than handwriting. Moreover, this tendency was also observed for people who can type fast with touch typing. This indicates that handwriting has a strong advantage in keeping information without interfering with other cognitive activities regardless of people's typing skill.
著者
Masayoshi Higuchi Yukio Fujii Yoji Hisamatsu
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.9, no.4, pp.228-233, 2021 (Released:2021-10-01)
参考文献数
21

Metallo-supramolecular polymers (MSPs) are a novel type of electrochromic (EC) materials. Ru(II)-based MSP (polyRu) composed of Ru(II) and bis(terpyridyl)benzene showed reversible color changes between orange and pale green. The orange color was caused by the metal-to-ligand charge transfer (MLCT) absorption in polyRu and disappeared by the electrochemical oxidation of Ru(II) to Ru(III). The pale green was returned to the original orange by the electrochemical reduction of Ru(III) to Ru(II). EC devices with polyRu were fabricated by the combination of an electrolyte solution, counter material, and two ITO glasses. The character images were displayed on the EC devices using insulating films. The insulating films prevented the electron transfer between the ITO glass and the polyRu layer and made the image stand out in the device. The fabricated EC display devices were presented at a science museum of Japan as experience-based exhibits.
著者
Junichi Shibasaki Kenichi Aoshima Shintaro Aso Nobuhiko Funabashi Takahiro Ishinabe Yosei Shibata Hideo Fujikake Kenji Machida
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.9, no.4, pp.240-246, 2021 (Released:2021-10-01)
参考文献数
22

We compare the diffraction characteristics of ferroelectric (FLC) and nematic liquid crystal (NLC) devices with one-dimensional stripe patterns of 1-10 µm pixel pitches. The polarizing micrographs show pixel boundaries of black/white pixels blur as the pixel pitch becomes smaller. The blur of NLC is more remarkable than that of FLC. The first-order diffraction efficiency of NLC remains constant for the pixel pitch of 4-10 µm and sharply decreases for the pixel pitch of < 2 µm. By contrast, the FLC efficiency decreases with the pixel pitch decrease from 10 to 4 µm and remains constant for the pixel pitch of < 3 µm. The FLC efficiency (5.5%) is four times larger than that of NLC (1.4%) with a 1 µm pixel pitch. The Fourier transform calculation shows the efficiency degradation of FLC is caused by the blur at the pixel boundary, whereas that of NLC caused by the blur and contrast deterioration.
著者
Keiichiro Kagawa
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.9, no.2, pp.114-121, 2021 (Released:2021-04-01)
参考文献数
31
被引用文献数
4

Multi-tap CMOS pixels that are composed of a single photodiode, multiple sets of a charge transfer gate and storage diode, and a draining gate can implement functional imaging. In this paper, imaging systems based on the multi-tap CMOS pixel are categorized into those with synchronized active illuminations and those using coded exposure. Applications for quantitative wide-field imaging based on spatial frequency domain imaging (SFDI) using structured light projection and multi-exposure laser speckle contrast blood flow imaging (MELSCI) utilizing multiple exposure times are shown. The multi-tap CMOS pixel provides additional benefits like suppression of ambient light and motion artifact with SFDI and efficient sampling at a video rate with MELSCI.
著者
Kenta Masui Genki Okada Norimichi Tsumura
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.8, no.1, pp.49-59, 2020 (Released:2020-01-01)
参考文献数
35
被引用文献数
10

The market size of online video advertising is expanding rapidly along with the spread of smartphones and social media. In this study, we estimate advertising effectiveness in the natural environment using online data collection and the remote measurement of webcam facial expressions and physiological responses. We collected 4, 108 videos of the faces of 411 Japanese people who were watching the video advertisement in their natural environment via the Internet. Facial expression and physiological responses such as heart rate and gaze were remotely measured by analyzing facial videos. We found that the accuracies of ad liking and purchase intent prediction are better when various acquired features are combined and machine learning is used than when only single-mode features are used. In addition, we aim to improve prediction accuracy by clustering the personality of the subjects and designing an estimation model for each personality.
著者
Wei-Ta Chu Hideto Motomura Norimichi Tsumura Toshihiko Yamasaki
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.7, no.2, pp.60-67, 2019 (Released:2019-04-01)
参考文献数
72
被引用文献数
3

With the advances in digital media processing technologies and the tremendous growth in the amount of digital media that have been created, new artworks are becoming possible and drawing much attention from researchers, industry, and consumers. A related emerging research area is the evaluation of such multimedia artworks by machine learning techniques. We call this research area “attractiveness computing.” Attractiveness computing is made possible by the great accumulation of such multimedia artworks and of consumers' responses. In this paper, we review existing research on multimedia artworks analysis and attractiveness computing.
著者
Cathal Gurrin Klaus Schoeffmann Hideo Joho Andreas Leibetseder Liting Zhou Aaron Duane Duc-Tien Dang-Nguyen Michael Riegler Luca Piras Minh-Triet Tran Jakub Lokoč Wolfgang Hürst
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.7, no.2, pp.46-59, 2019 (Released:2019-04-01)
参考文献数
31
被引用文献数
69

The Lifelog Search Challenge (LSC) is an international content retrieval competition that evaluates search for personal lifelog data. At the LSC, content-based search is performed over a multi-modal dataset, continuously recorded by a lifelogger over 27 days, consisting of multimedia content, biometric data, human activity data, and information activities data. In this work, we report on the first LSC that took place in Yokohama, Japan in 2018 as a special workshop at ACM International Conference on Multimedia Retrieval 2018 (ICMR 2018). We describe the general idea of this challenge, summarise the participating search systems as well as the evaluation procedure, and analyse the search performance of the teams in various aspects. We try to identify reasons why some systems performed better than others and provide an outlook as well as open issues for upcoming iterations of the challenge.
著者
Yugo Sato Tsukasa Fukusato Shigeo Morishima
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.7, no.2, pp.68-79, 2019 (Released:2019-04-01)
参考文献数
60
被引用文献数
1

This paper presents an interactive face retrieval framework for clarifying an image representation envisioned by a user. Our system is designed for a situation in which the user wishes to find a person but has only visual memory of the person. We address a critical challenge of image retrieval across the user's inputs. Instead of target-specific information, the user can select several images that are similar to an impression of the target person the user wishes to search for. Based on the user's selection, our proposed system automatically updates a deep convolutional neural network. By interactively repeating these process, the system can reduce the gap between human-based similarities and computer-based similarities and estimate the target image representation. We ran user studies with 10 participants on a public database and confirmed that the proposed framework is effective for clarifying the image representation envisioned by the user easily and quickly.
著者
Hideki Kakeya Atsushi Yoshida Bin Yang Yukio Oshiro Nobuhiro Ohkohchi
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.6, no.1, pp.11-17, 2018 (Released:2018-01-01)
参考文献数
20
被引用文献数
2

We present a liver surgery simulator using full-HD autostereoscopic displays. We have developed two kinds of autostereoscopic displays to keep on showing a full-HD 3D image to a viewer who moves freely in front of the display. One is a 3D display based on time-division multiplexing directional backlight and the other is a 3D display based on time-division multiplexing parallax barrier. We have applied the developed simulator using the 3D displays with different specifications to the education of medical students. The result of the questionnaires suggests that 3D visualization is effective and that reduction of crosstalk plays an important role to promote medical use of 3D displays.
著者
Changyo Han Takeshi Naemura
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.7, no.1, pp.11-19, 2019 (Released:2019-01-01)
参考文献数
19
被引用文献数
4

This paper introduces BumpMarker: a 3D-printed tangible marker that can perform simultaneous tagging, position tracking, and weight measurement of objects on pressure sensor sheets. The markers baseplate features several pins (raised dots) whose locations encode embedded information. A matrix pressure sensor sheet captures the pressure map of a marker-attached object on a sheet. The embedded data and object weight can be retrieved by processing the pressure map. We propose our design to achieve robust detection of the pins. We also show that our system has the ability to monitor weight changes in tagged objects. Through a series of evaluations, we investigate the technical feasibility of BumpMarker.
著者
Shinya Ichino Takezo Mawaki Akinobu Teramoto Rihito Kuroda Shunichi Wakashima Tomoyuki Suwa Shigetoshi Sugawa
出版者
The Institute of Image Information and Television Engineers
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.6, no.3, pp.163-170, 2018 (Released:2018-07-01)
参考文献数
37
被引用文献数
5

Random telegraph noise (RTN) that occurs at in-pixel source follower (SF) transistors and column amplifier is one of the most important issues in CMOS image sensors (CIS) and reducing RTN is a key to the further development of CIS. In this paper, we clarified the influence of transistor shapes on RTN from statistical analysis of SF transistors with various gate shapes including rectangular, trapezoidal and octagonal structures by using an array test circuit. From the analysis of RTN parameter such as amplitude and the current-voltage characteristics by the measurement of a large number of transistors, the influence of shallow trench isolation (STI) edge on channel carriers and the influence of the trap location along source-drain direction are discussed by using the octagonal SF transistors which have no STI edge and the trapezoidal SF transistors which have an asymmetry gate width at source and drain side.
著者
Hidehiko Shishido Yoshinari Kameda Yuichi Ohta Itaru Kitahara
出版者
映像情報メディア学会
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.5, no.3, pp.110-120, 2017 (Released:2017-07-01)
参考文献数
13
被引用文献数
5

This paper introduces a method that uses multiple-view videos to estimate the 3D position of a badminton shuttle that moves quickly and anomalously. When an object moves quickly, it is observed with a motion blur effect. By utilizing the information provided by the shape of the motion blur region, we propose a visual tracking method for objects that have an erratic and drastically changing moving speed. When the speed increases tremendously, we propose another method, which applies the shape-from-silhouette technique, to estimate the 3D position of a moving shuttlecock using unsynchronized multiple-view videos. We confirmed the effectiveness of our proposed technique using video sequences and a CG simulation image set.
著者
Tetsuya Watanabe Hirotsugu Kaga
出版者
映像情報メディア学会
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.5, no.1, pp.2-7, 2017 (Released:2017-01-01)
参考文献数
22
被引用文献数
3

To determine the optimum size of a braille font, we conducted an experiment in which a popular Japanese braille font was printed at various sizes on capsule paper and read and rated by late blind people. The results show that braille printed at 16 to 19-point sizes was read faster and rated higher than that printed at smaller or larger sizes. These optimum sizes mostly coincide with those found for young congenitally blind people. A new finding was that many reading errors that stemmed from mistaking the range of braille cells were observed at larger sizes, 20 to 22-point sizes. This means that enlarging the font size is not necessarily beneficial for late blind people and optimum sizes should be strictly selected when doing so.
著者
Ali S. Razavian Josephine Sullivan Stefan Carlsson Atsuto Maki
出版者
映像情報メディア学会
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.4, no.3, pp.251-258, 2016 (Released:2016-07-01)
参考文献数
39
被引用文献数
196

This paper provides an extensive study on the availability of image representations based on convolutional networks (ConvNets) for the task of visual instance retrieval. Besides the choice of convolutional layers, we present an efficient pipeline exploiting multi-scale schemes to extract local features, in particular, by taking geometric invariance into explicit account, i.e. positions, scales and spatial consistency. In our experiments using five standard image retrieval datasets, we demonstrate that generic ConvNet image representations can outperform other state-of-the-art methods if they are extracted appropriately.
著者
George Awad Cees G. M. Snoek Alan F. Smeaton Georges Quénot
出版者
映像情報メディア学会
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.4, no.3, pp.187-208, 2016 (Released:2016-07-01)
参考文献数
36
被引用文献数
1 20

Semantic indexing, or assigning semantic tags to video samples, is a key component for content-based access to video documents and collections. The Semantic Indexing task has been run at TRECVid from 2010 to 2015 with the support of NIST and the Quaero project. As with the previous High-Level Feature detection task which ran from 2002 to 2009, the semantic indexing task aims at evaluating methods and systems for detecting visual, auditory or multi-modal concepts in video shots. In addition to the main semantic indexing task, four secondary tasks were proposed namely the “localization” task, the “concept pair” task, the “no annotation” task, and the “progress” task. It attracted over 40 research teams during its running period. The task was conducted using a total of 1,400 hours of video data drawn from Internet Archive videos with Creative Commons licenses gathered by NIST. 200 hours of new test data was made available each year plus 200 more as development data in 2010. The number of target concepts to be detected started from 130 in 2010 and was extended to 346 in 2011. Both the increase in the volume of video data and in the number of target concepts favored the development of generic and scalable methods. Over 8 millions shots×concepts direct annotations plus over 20 millions indirect ones were produced by the participants and the Quaero project on a total of 800 hours of development data. Significant progress was accomplished during the period as this was accurately measured in the context of the progress task but also from some of the participants' contrast experiments. This paper describes the data, protocol and metrics used for the main and the secondary tasks, the results obtained and the main approaches used by participants.
著者
Kiya Hitoshi Dobashi Toshiyuki
出版者
一般社団法人 映像情報メディア学会
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.4, no.1, pp.2-9, 2016

This paper addresses a unified tone mapping operation (TMO) for HDR images with fixed-point arithmetic. A TMO generates a low dynamic range (LDR) image from a high dynamic range (HDR) image by compressing its dynamic range. A unified TMO can perform tone mapping for various HDR image formats with a single common TMO. Since HDR images are generally expressed in a floating-point data format, a TMO also deals with floating-point data even though resulting LDR images have integer data. As a result, conventional TMOs require many resources such as computational and memory cost. To reduce the resources, the method which allows to replace a floating-point number with two 8-bit integer numbers was proposed. However, this method has a limitation of available input HDR image formats. The proposed unified TMO can be applied for various formats such as the RGBE and the OpenEXR by introducing an intermediate format. Moreover, the method can conduct all calculations in the TMO with fixed-point arithmetic. By using both integer data and fixed-point arithmetic, the method reduces not only the memory cost but also the computational cost. The experimental and evaluation results show the proposed method reduces the computational and memory cost, and gives almost same quality of LDR images, compared to the conventional method with floating-point arithmetic.
著者
Shingo Nagasaka Yuki Uranishi Shunsuke Yoshimoto Masataka Imura Osamu Oshiro
出版者
映像情報メディア学会
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.3, no.4, pp.279-286, 2015 (Released:2015-10-01)
参考文献数
12
被引用文献数
3

This paper proposes a system that provides the sensation of touching virtual objects in a mobile touch panel using a retractable stylus and the mobile touch panel. The proposed system provides a sensation like the stylus is being inserted into the monitor, and that the user is actually touching the object in the screen when the user pushes a retractable stylus downward on the display. A DC motor is mounted in the retractable stylus, and this motor shrinks the length of the stylus based on feedback from a pressure sensor in the tip of the stylus. When the tip of the virtual stylus touches a virtual object, a voice coil motor in the stylus oscillates according to the surface of the virtual object. So the user experiences a sensation like touching the object on the monitor by using the proposed system.
著者
Katsuki Kobayashi Takahiro Ogawa Miki Haseyama
出版者
映像情報メディア学会
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.1, no.4, pp.333-342, 2013 (Released:2013-10-01)
参考文献数
24
被引用文献数
5

This paper presents a new evaluation criterion for visualization of image search results based on the feature integration theory. This criterion is derived by combining two elements, visual saliency on visualization and grouping degree of similar images. Visual saliency, which is calculated from the feature integration theory, on visualization of image search results enables representation of users' attention, which is closely related to the effectiveness of finding images. Furthermore, since users perceive similar images that are close to each other as one group, grouping degree of similar images enables evaluation of the effectiveness when users find images similar to a desired image. Therefore, by combining visual saliency on visualization and grouping degree of similar images, we can derive the novel criterion and evaluate the effectiveness of visualization of image search results.
著者
Sho Takahashi Miki Haseyama
出版者
映像情報メディア学会
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.1, no.3, pp.220-225, 2013 (Released:2013-07-01)
参考文献数
13
被引用文献数
1 2

An Active grid-based method for estimating pass regions from broadcast soccer videos is presented in this paper. It is assumed that the pass region has a high probability of the pass succeeding. In soccer matches, players discover pass regions based on previous and current player positions. In conventional methods, pass regions are estimated by applying Active Net to only a single frame of a soccer video. In the proposed method, Active grid is applied to three-dimensional data by which frames of the soccer video are connected with the temporal dimension. The proposed method then realizes robust estimation of pass regions based on multiple frames of player positions. The proposed method was applied to actual TV programs to verify its effectiveness.
著者
Soh Yoshida Hiroshi Okada Takahiro Ogawa Miki Haseyama
出版者
映像情報メディア学会
雑誌
ITE Transactions on Media Technology and Applications (ISSN:21867364)
巻号頁・発行日
vol.1, no.3, pp.237-243, 2013 (Released:2013-07-01)
参考文献数
19

This paper presents a new method to improve performance of SVM-based classification, which contains a target object detection scheme. The proposed method tries to detect target objects from training images and improve the performance of the image classification by calculating the hyperplane from the detection results. Specifically, the proposed method calculates a Support Vector Machine (SVM) hyperplane, and detects rectangular areas surrounding the target objects based on the distances between their feature vectors and the separating hyperplane in the feature space. Then modification of feature vectors becomes feasible by removing features that exist only in background areas. Furthermore, a new hyperplane is calculated by using the modified feature vectors. Since the removed features are not part of the target object, they are not relevant to the learning process. Therefore, their removal can improve the performance of the image classification. Experimental results obtained by applying the proposed methods to several existing SVM-based classification method show its effectiveness.