文献一覧: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (雑誌)

1 0 0 0 テンソルと多視点幾何

著者: 佐藤淳
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2007, no.42, pp.33-42, 2007-05-14
参考文献数: 32
被引用文献数: 2

コンピュータビジョンでは、90年代に多視点幾何においてテンソル表記が用いられるようになって以来、様々な分野で用いられるようになり、今日ではグラフィックスにおける表現手段として、また認識の一手法としても用いられつつある。本稿では、テンソルの基礎となる多重線形性や多重線形性の上で成り立つ多重線形拘束について解説し、これらが今日の多視点幾何の理論においてどのように応用されているかを示す。テンソルに基づく情報表現や多重線形拘束は、多視点幾何に限らずコンピュータビジョンの多くの分野で応用可能な考え方であることから、多くの読者の参考となることを期待する。In computer vision, the tensor notation has been used for representing the multiple view geomery since 1990s, and now it is used in various fields, such as graphics representation, image recognition, etc. In this paper, I explain the basics of the tensor and its properties, such as multilinearity and tensor product, and show how the tensor is used in the multiple view geometry. Since the tensor representation is useful in various fields in computer vision, I hope this paper will be a good referece for many readers.

2013-08-20 15:45:43
1 + 1 Twitter

https://ci.nii.ac.jp/naid/110006345158

1 0 0 0 基底反射特性モデルの線形結合による双方向テクスチャ関数と物体形状の同時復元の検討

著者: 関口真右田剛史尺長健
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2005, no.18, pp.189-196, 2005-03-04
参考文献数: 9

物体表面の反射特性の表現法として双方向テクスチャ関数(BTF)が提案されている.厳密なBTFの獲得はパラメータの自由度が高く,困難である.そこで本稿では,パラメータの自由度を減らすために,基底反射モデルの線形結合による表現方法について述べ,これを用いた反射特性と物体形状の同時復元について検討する.基底モデルにTorrance-Sparrowモデルを利用して,反射特性の表現に必要なパラメータ(基底モデル・結合重み・形状・光源位置)の推定を行う.推定するパラメータ数が十万を超える大規模非線形最適化問題であるが,共役勾配法を拡張したBDCG法によって解くことができる.陶器を対象物体とし実験を行い,手法の有効性を確認した.Synthesizing photo-realistic images requires realistic models for geometry and photometry. We address an acquisition and representation method for a Bidirectional Texture Function which is a 2D array of functions representing appearance changes of a texture with respect to changes in lighting condition. We discuss a method for simultaneous estimation of the model parameters and the object shape from real images of potteries

2013-02-22 10:50:23
1 はてなブックマーク

https://ci.nii.ac.jp/naid/110002694829

1 0 0 0 合成開口撮影法によるデフォーカスコントロール

著者: 楠本夏未日浦慎作佐藤宏介
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2008, no.36, pp.85-92, 2008-05-01
参考文献数: 8
被引用文献数: 2

撮影後に写真の焦点を変化させ,ぼけ部分の描写を制御する研究は多く行われているが,どれも特別な装置,または撮影法を必要とする.また,一般に普及している小型デジタルカメラでは十分な大きさのぼけを得ることが出来ず,主要被写体を引き立たせるような作画が出来ない.そこで本研究では,小型デジタルカメラのみを使用し,位置・姿勢が未知の複数視点から撮影を行って得た画像群をもとにユーザの意図した「ぼけ」を含む画像を作成する手法を提案する.更に,合成時にぼけ像の大きさや形状,重み付けを変化させることで,様々な「ぼけ味」を作り出す手法も提案する.また,以上のようなシステムで作成したぼけ画像を評価する.There are a lot of researches for controlling the focus and blur of previously taken pictures. However, all of these researches require some special devices or techniques for taking pictures. Besides, enough size of blur can not be created with an ordinary compact digital camera, so it is hard to take pictures with impressively attracting main subject. Therefore, In this paper, we propose a method to create the image with demanded blur from the set of images taken by a compact digital camera placed at unknown viewpoints. Moreover, it is possible to produce various taste of blur by changing size, shape or weight of the blur kernel. In addition, we evaluated the quality of blurred images made by our system.

2013-02-22 01:15:05
1 + 0 Twitter

https://ci.nii.ac.jp/naid/110006791926

1 0 0 0 メディア認識結果へのルール適用によるメタデータの自動生成とダイジェスト配信サービスへの適用

著者: 桑野秀豪山田智一川添雄彦
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2006, no.25, pp.455-460, 2006-03-17
被引用文献数: 5

ダイジェスト映像配信サービス向けのメタデータの自動生成システムとモバイル端末向けのメタデータアプリケーションを紹介する。開発したメタデータ生成システムは映像・音声などの複数のメディア認識結果にルールを適用することで、番組中の重要なシーンを自動抽出し、抽出したシーンを予め決められた時間長のダイジェスト映像として自動編集する。また、モバイル端末上にパラパラ漫画モードなど番組ダイジェストを速覧できるアプリケーションソフトを実装した。開発システムを用いて、野球、サッカーなどのライブスポーツ番組に対し、ダイジェスト映像を自動編集した後、モバイル端末への即時配信を実現し、システムの有効性を確認した。This paper proposes a automatic metadata generation system for digest video distribution service. The system extracts significant scenes by applying heuristic rules to the results of media analyses. The system can also automatically produce digest videos of the desired duration. We also developed the novel application of digest viewing for mobile TV service. The application provides multiple viewing modes such as text mode, video comic mode, and digest video mode. We implemented the application on a mobile phone and can demonstrate the sports digest distribution service.

2013-01-11 20:00:04
1 + 0 Twitter

https://ci.nii.ac.jp/naid/110004788384

1 0 0 0 行動履歴を反映させた適応的環境属性を伴う三次元人物追跡

著者: 杉村大輔小林貴訓佐藤洋一杉本晃宏
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2006, no.115, pp.171-178, 2006-11-10
被引用文献数: 2

パーティクルフィルタを用いた人物追跡技術は,これまでに様々な手法が提案されその有用性が報告されている.一方で,追跡の安定化に有効な画像以外の手がかりとして,環境属性情報がある.環境属性情報とは,対象シーン内の人物の存在可能性を意味し,これを追跡に利用することで人物が存在しにくい箇所への仮説の生成を抑制することができる. したがって環境属性情報は効率的な仮説の生成に役立つと考えられる.この指標には,障害物の配置などの物理的な制約に依るもの,人物の行動履歴に依るものがある.本研究で提案する手法では,特に後者に焦点を当て,環境属性情報の獲得と追跡の枠組みへの統合を行う.具体的には,環境属性情報を混合正規分布で表現し,オンラインEMアルゴリズムにより人物行動履歴を逐次的に学習することで獲得する.さらに,ICONDENSATION の考えに基づき,追跡の枠組みに統合する. 実環境における実験により本手法の有効性を確認した.Various tracking techniques based on particle filters have been proposed. To enhance the robustness of tracking, it is very significant to consider environmental attributes which represent an existing probability of people in a scene. They can be used for an effective hypothesis generation by considering the area where people are likely to exist. The environmental attributes can be considered in two aspects: the one is based on the physical configuration of objects in a scene and the other is based on the history of people activities. In this paper, we establish the history based environmental attributes that are updated by people tracking result everyframe using the online EM algorithm. Furthermore, we incorporate them into our tracking algorithm by using the ICONDENSATION framework. Our experimental results demonstrate the effectiveness of our method.

2013-01-06 09:29:00
1 知恵袋

https://ci.nii.ac.jp/naid/110005716410

1 0 0 0 実時間手領域追跡によるユーザインタフェースの実装

著者: 木俣孝一若間俊旭岡田至弘
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2008, no.3, pp.49-54, 2008-01-17

本研究では、複数対象の手と顔の位置関係を重回帰分析によってモデル化する手法を提案する。これにより、Mean ShiftやCAMSHIFTを用いた追跡アルゴリズムの課題である類似した色の対象の重なりや一時的な遮蔽による追跡失敗を回復・修正できることを示す。また、手のジェスチャを用いたインタフェースの例として、手の動きに合わせて表示された画像の操作が可能なバーチャルウォールへ実装について述べる。This paper describes the way that the position of the hand and the face is modeled by Multiple Regression Analysis. Mean Shift Algorithm and CAMSHIFT Algorithm can't track the object that occluded by the other objects. But, this way modify tracking point that loses sight of the object. In addition, we implemented the Virtual Wall as a example of interface. It operates a image by user's gestures.

2013-01-06 09:29:00
1 知恵袋

https://ci.nii.ac.jp/naid/110006623131

1 0 0 0 複数のステレオカメラを用いた人物形状モデルの復元

著者: 宮城玲子日浦慎作佐藤宏介
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2004, no.40, pp.93-100, 2004-05-06

侵入者の追跡など一般の日常シーンにおける画像処理では様々な未知の対象を即座に取り扱う必要があるため,対象のモデルをオンライン的に構築する必要がある.そこで我々は複数のステレオカメラを用いて対象の全周形状モデルを得ることを目的として研究を行った.対象物体が計測に適した位置に移動したことを知るため,また対象を背景から切り出すために視差に基づく背景差分法とテンプレートマッチングにより対象の位置を求めた.テクスチャが乏しい面や鏡面反射を持つ面などについては視差が安定に求められないことがあるため,背景の視差の変動を調べ利用することで安定化した.また基準物体を用いて2台のカメラ同士の位置関係を求め距離画像の位置あわせを行った.In this paper, we propose a measurement system of human body with multiple stereo cameras. This system focuses to an application of surveillance and monitoring in daily life. Prior to the measurement, the region of the object is extracted using background subtraction of disparity value, and then the position of the object is tracked using two-stage template matching. To remove the noise of background subtraction caused by specular reflection or lack of texture, statistical model of background is used. Calibration of two cameras is achieved using reference object and the shape and texture model of human face is correctly aligned as shown in experimental result.

2013-01-06 09:29:00
1 知恵袋

https://ci.nii.ac.jp/naid/110002664140

1 0 0 0 テンプレートの可変分割と統合による人物の動作追跡

著者: 齊藤慎也佐治斉
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2004, no.91, pp.33-39, 2004-09-10

人物全体をテンプレートとしたマッチングによる人物の動作追跡では,テンプレート内に含まれる背景領域や人物の姿勢変化などにより誤追跡となりやすい.そこで本研究では,人物全体を囲む領域をテンプレートとして抽出した後,人物の形状に合わせてテンプレート領域を分割・統合してマッチングを行う動作追跡システムを提案する.まず,背景差分を利用して人物領域を抽出し,その領域を囲むようなテンプレートを作成する.そして,テンプレート内に背景領域と人物領域が混在している場合は,テンプレートを4つのブロックに分割し,逆に背景領域または人物領域のみの場合は分割しない.この処理を繰り返す.次に,分割されたブロックのうち,背景のみの領域を含むものを削除し,人物の中心に近いものは統合する.実画像上で抽出されたテンプレートに対してマッチングを行った結果,正しい追跡結果が得られた.Human motion is often tracked by matching a template whose area surrounds the human region. However, this method causes mismatching because of the background region and the human form change. We propose a new tracking method using the template matching. In our method, a template is divided and integrated fitting to the human form. First, we extract a template around a human region by computing the difference between the background image and the current image frame. Next, if the template area includes both the background region and the human region, we divide the template into four blocks. If the template area includes only the background region or only the human region, we do not divide the template. We repeat this process. Finally, we delete the blocks including the background area only and integrate the blocks near the human center. We obtain the correct matching result by using the template extracted on the real image.

2013-01-06 09:29:00
1 知恵袋

https://ci.nii.ac.jp/naid/110002664178

1 0 0 0 マルチテンプレートの挙動解析に基づく状態推定を用いた人物追跡の研究

著者: 伊藤雅人福添孝明水戸大輔渡邊睦
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2005, no.88, pp.145-152, 2005-09-06
被引用文献数: 1

人物追跡中の複数テンプレートの挙動を解析することにより,現在の状態(安定追跡中,環境物体による遮蔽,人物同士のすれ違い,立ち止まり静止,着席)を推定し,この結果に基づいて安定な追跡を実現する手法について述べる.個々のテンプレートにはカルマンフィルタによる線形予測機能を付与し,また探索範囲などを相互に制約するように設計した.本提案方式をPC上のソフトウェアとして実装し,アクティブカメラを用いて屋内における特定人物追跡の実験を行い,有効性を確認した.The authors propose a new tracking method based on the estimation of current status,such as,"stable tracking" "occlusion by environmental object" "passing each other" "stopping" "seating" by analysing multi-template trajectories.The alignment prediction function by the Kalman filter is given to each template, and search ranges of each template are mutually restricted. This proposed method was implemented as software on PC. Experimental results of specific person tracking in indoor by using an active camera have shown the effectiveness of the proposed method.

2013-01-06 09:29:00
1 知恵袋

https://ci.nii.ac.jp/naid/110002702279

1 0 0 0 顔認識のための3次元形状復元に基づく任意視点顔画像生成

著者: 菅谷佳子安藤慎吾鈴木章小池秀樹
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2008, no.27, pp.73-78, 2008-03-10

単眼カメラで姿勢変動にロバストな顔認識を行うためには、任意視点の顔画像を認識辞書として登録する手法が有効である。本稿では、連続的に撮影された複数視点の顔画像から顔の 3 次元形状復元を行い、任意視点の顔画像を生成する手法を提案する。本手法では、多視点画像からの顔の特徴点追跡に Active Appearance Model を使用し、因子分解法を用いて3次元形状復元することでユーザーごとの3次元顔モデルを生成する。得られた3次元顔モデルから任意視点画像を生成するため、大きな向き変動のある顔画像も生成することが可能である。本稿では、生成した任意視点顔画像と様々な方向から実際に撮影された顔画像とで評価実験を行い、本手法の有効性を示す。Registrating multiple view face images is an effective way of improving the robustness of face recognition against pose variations. This paper proposes a method that generates multiple view face images based on 3D face reconstruction from one image sequence. The proposed method constructs a 3D face model from the Active Appearance Model in the facial feature tracking stage and the factorization method for 3D shape reconstruction. The constructed 3D face model allows many face images with large pose variation to be generated easily. Experiments using the generated face images and real images show the effectiveness of the proposed method.

2012-07-31 22:45:22
1 + 0 Twitter

https://ci.nii.ac.jp/naid/110006820227

1 0 0 0 偏光情報を利用した陰影領域の分割

著者: 佐藤智金森克洋
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2008, no.115, pp.201-206, 2008-11-20

被写体の法線情報や被写体間の相対関係,光源情報を取得するために, attached shadow と cast shadow からなる陰影情報が有効である. attached shadow 情報からは被写体の法線情報, cast shadow 情報からは被写体間の位置関係や光源情報を取得することができる.しかし,陰影情報から attached shadow と cast shadow を分割する手法はあまり研究されていない.本研究では,偏光情報を利用することで,光源移動を伴わないパッシブな撮像手法により,陰影領域を attached shadow と cast shadow に分割する手法を提案する.For object information acquisition, for example, normal information, spatial arrangement of the object relative to another object and illumination distribution, shadow information is effective. Shadow consists attached shadow and cast shadow. Using attached shadow, we can estimate object's normal information. Cast shadow is informative about spatial arrangement and illumination distribution. However, little study is known about classification of shadow into attached shadow and cast shadow. In this paper, we propose the passive method that classifies shadow into attached shadow and cast shadow using polarization information.

2012-07-23 18:40:38
1 + 0 Twitter

https://ci.nii.ac.jp/naid/110007098104

1 0 0 0 実物体と映り込みの分離方法

著者: 熊木健二山村毅田中敏光大西昇
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.1995, no.108, pp.31-37, 1995-11-16
被引用文献数: 5

本研究では,ガラスなどの透明物体に映り込んだ虚像と,その背後にある物体を分離する手法を提案する.一般に,ガラスの表面で反射する光は,ガラスを透過してくる光よりも強く偏光しているので,偏光フィルタを適切な角度(偏光角)で用いることにより,映り込み物体を除去することができる.偏光角は,ガラス及び映り込みを生じさせる物体のカメラに対する位置と姿勢により求めることができるが,これは,容易でない.そこで,偏光フィルタを少しずつ回転させながら取り込んだ複数枚の画像から各画素の最大値・最小値を求めることにより,実物体と映り込み物体の分離を行う.実画像に対して本手法を適用し,その有効性を示す.This paper proposes a method for separating real objects behind glass and virtual ones reflected on it. Generally speaking, light is more polarized when reflected on glass than when transmitted through it. This optical property allows us to eliminate virtual objects by a polarizing filter set to a proper angle, called "polarizing angle". This angle is calculated from the relative orientations of the glass and the virtual objects to a camera. However it is usually difficult to implement such calculation. Therefore, we use a series of images taken by rotating the filter. Obtaining the minimum and maximum images, we can separate real and virtual objects. Experiment with real scenes is presented to demonstrate the effectiveness of the proposed method.

2012-05-25 22:57:55
1 + 0 Twitter

https://ci.nii.ac.jp/naid/110002930323

1 0 0 0 囲碁対局テレビ番組からの棋譜自動生成システム

著者: 林山剛久柳井啓司野下浩平
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2002, no.34, pp.161-168, 2002-05-09
被引用文献数: 4

テレビ放送からの自動情報抽出の研究の一例として、本研究では、囲碁対局テレビ番組から画像認識によって自動的に対局棋譜を生成するシステムを提案する。システムは、囲碁対局番組の画面画像を取り込み、画面中の囲碁盤の位置を検出し、置石(囲碁盤上に置かれている石)を検出して、囲碁対局の棋譜の自動生成を行う。対局中の画面画像を認識する際には、対局画面とそれ以外の対局には直接無関係な画面の識別や、囲碁盤上に現れる指し手の手や頭などの置石以外の物体の除去などの、囲碁対局テレビ番組の特有の問題点に対する対処を行う。我々は、実装したシステムを用いて、実際の11対局分の囲碁対局番組に対して実験を行い、96%の適合率と83%の再現率を得た。In this paper, we present an image-recognition system for generating Go-kifu (i.e., Go-records) automatically from games played by human professionals on a TV program. The system takes screen images of a TV program every several seconds, determines the position of the Go-board, and detects Go-stones played on the board. The system deals with several problems encountered on the TV program. It has to remove several types of noise like players' heads or hands, and discriminate a scene of the game from other various scenes like conversations by commentators. We have implemented the system, and performed experiments for eleven games on the TV program, whose results show high performance of the system. In particular, the precision and recall rate are 96% and 83%, respectively.

2012-04-08 21:21:03
1 はてなブックマーク

https://ci.nii.ac.jp/naid/110002674693

1 0 0 0 閾下/閾上視線手がかりによる注意シフト

著者: 川畑秀明関口達彦
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2008, no.82, pp.37-42, 2008-08-29

視線はコミュニケーションにおいて他者の心を理解する上で重要な役割を果たしている。これまで,他者の視線方向が示す位置方向へ自動的 (反射的) な注意シフトを起こすことが心理学的に知られており,視線が向いた方向に提示されたターゲットに対して反応が促進される。本研究では,線画顔刺激において刺激提示時間を変数に,視線の意識的気づきに求められる時間的閾値を測定し,線画顔の瞳の位置を横線に置き換えた刺激,目の位置を矢印に置き換えた刺激の測定とともに比較・検討した (実験 1) 。その結果,顔刺激<矢印刺激<横線刺激の順に位置方向検出に必要な時間は長くなることが示された。次に,線画顔刺激のみを用いて,顔刺激の刺激提示時間を変数に顔刺激の後に提示されるターゲットの検出について,提示時間が閾上/閾下である場合とで検討した (実験 2) 。その結果,視線が意識的に気づく条件でも,気づかない提示条件でも,ターゲットに対する自動的シフトは起こらないが,むしろターゲットが視線手がかりに対して無効な場所に提示される場合において,抑制的な働きを持つことが明らかになった。Facial gaze seems to be very important for understanding others' mind in human communication. Automatic or reflexive attentional shift in response to another individual's gaze direction has been reported. In this study, we examined temporal thresholds to be seen whether direction cue in line drawing stimuli shows right or left side, in face gaze cue , horizontal line cue, and arrow cue, by presenting in different stimulus durations (Experiment 1). This showed the temporal limit in face gaze was much faster than one in line cue and arrow cue. Also, we tried to determine temporal point supraliminally or subliminally that the cue direction can be perceived with or without awareness. We next examined whether automatic attentional shift to a target can occur with or without awareness of the gaze direction, as a function of presentation duration of face gaze cue (Experiment 2). The reaction time needed to localize the target was consistently shorter for valid than invalid gaze cues, but the reaction times were almost constant both in subliminal and suplaliminal duration times.

2012-02-07 23:42:56
1 + 0 Twitter

https://ci.nii.ac.jp/naid/110006969326

1 0 0 0 顔らしさ分布を利用した顔検出手法

著者: 高塚皓正田中正行奥富正敏
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2006, no.93, pp.73-80, 2006-09-08
被引用文献数: 2

顔検出はコンピュータビジョンにおいて,近年注目されている技術の一つであり,様々な研究が行なわれている従来の顔検出に関する研究の多くは,切り取られたサブウィンドウが顔かどうかを判別する識別器を改良することを主な目的としている.この識別器は顔検出を行なう際の?プロセスとして重要であるしかし,これらの識別器は各サプウィンドウに対して,独立に顔または非顔を判定するため,識別器の出力(顔らしさ)の高い非顔画像を誤検出してしまうことがよくある.本報告では,顔と非顔における顔らしさ分布の違いに着目し,この違いを陽に利用した新しい顔検出の枠組みについて提案する.提案手法では,顔らしさ分布を生成し,統合処理により顔と非顔の違いを強調することで,従来手法で誤検出していた非顔を正しく分類することが可能になる.実験では,テストデータセットと実画像を用いて,提案手法の有効性を確認したその結果,それぞれ20%と10%の検出率の向上が見られた.Face detection is a useful technique in computer vision. Many face detectors have been developed in the literature. Almost all approaches for face detection focus on the face detectors which classify a given subwindow into face or non-face. However, in face detection process, since the detectors also evaluate the scanned sub-windows independently, non-faces with high face likelihood are often misdetected. In this paper, we propose a novel face detection algorithm which explicitly uses difference of face likelihood distribution between faces and non-faces. The proposed algorithm can correctly classify the non-faces misdetected by the existing algorithm. The face likelihood distribution is generated and integrated to emphasize the difference between faces and nonfaces. Experiments with pre-scanned data set and real-world images show that the proposed algorithm improves the detection rate approximately by 20% and 10%, respectively.

2012-01-12 08:51:36
1 はてなブックマーク

https://ci.nii.ac.jp/naid/110004820676

1 0 0 0 画像の特徴点に共分散行列は本当に必要か?

著者: 金澤靖金谷健一
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2001, no.23, pp.1-8, 2001-03-08
被引用文献数: 6

我々は従来より画像の特徴点の位置の不確定性を共分散行列によって表現し,それに基づいた最適化推定の手法を開発してきた.本稿では,まず従来から提案されている画像の濃淡値から共分散行列を計算する方法を統一的に定式化し,それが本当に特徴点の位置の精度を反映しているのかどうかを可変テンプレートマッチングによるサブ画素補正を行うことにより,実験的に検証する.そして,このような共分散行列を用いた場合に射影変換行列および基礎行列の最適計算の精度が向上するかどうかを調べる.これらの結果を画像間の対応づけのための半自動的システムへ応用する.We have explored various statistical optimization techniques based on covariance matrices that characterize the uncertainty of the positions of feature points in the images. We first describe how to compute the covariance matrix of a feature point from the gray levels by integrating existing methods. Then, we experimentally examine if thus computed covarinace matrices really reflect the accuracy of the feature positions. For this purpose, we observe the correlation between the feature covariance and the amount of subpixel correction resulting from variable template matching, using real images. We also test if the accuracy of computing the homography and the fundamental matrices from two images can be really improved by statistical optimization based on the covariance matrices. Finally, we apply our results to semi-automatic systems for matching two images.

2011-11-28 14:38:26
1 + 0 Twitter

https://ci.nii.ac.jp/naid/110002674625

1 0 0 0 感性協調に基づいたインタラクティブな音コミュニケーション環境の構築

著者: 西田正吾才脇直樹仲谷美江加藤博一
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2003, no.88, pp.147-154, 2003-09-08
参考文献数: 31

近年,ヒューマンインタフェースの分野では,ヒューマンインタラクションやコミュニケーションを扱う研究が増加しており,その分析手法や支援のあり方に関する議論が盛んに行われるようになってきている.本稿では,「感性協調に基づいたインタラクティブな音コミュニケーション環境の構築」というテーマに対し、筆者らの研究グループで取り組んできた研究成果について紹介する.具体的には,感性協調による共感世界構築の枠組みについて述べた後,エージェントにおける仮想感性構築法,共通の場構築のためのインタフェース,感情表現を介したダンスと音楽の関連づけ等の要素技術について述べる.Human interaction and human communication are hot topics in the field of Human Interfaces recently. This paper deals with design and implementation of interactive sound communication environment based on Kansei coordination, by focusing on affective aspects of human interaction. Concretely, a framework to support Kansei coordination is discussed, and component technologies, such as realization of artificial Kansei in agent, interfaces for common platform and analysis of relations between dance and music etc., are introduced.

2011-10-26 16:19:35
1 はてなブックマーク

https://ci.nii.ac.jp/naid/110002664108

1 0 0 0 画像認識におけるカーネル学習法

著者: 西田健次栗田多喜夫
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2005, no.38, pp.333-342, 2005-05-13

カーネル学習法は、非線形識別関数を効率よく構成する手法であり、カーネルトリックとも呼ばれている。サポートベクターマシンは、現在知られている多くのパターン認識手法の中でも認識性能の優れた手法であると考えられているが、カーネルトリックによって非線形識別関数を構成できるようになったことが、その性能向上に大きく貢献している。カーネル学習法とサポートベクターマシンに代表される線形識別手法を組み合わせることにより高性能な識別器を構成する事が可能になったが、未学習データに対する認識性能(汎化性能)を更に向上するためには変数選択などの手法が重要な役割を果たす。本稿では、サポートベクターマシンを中心にカーネル学習法について概説し、汎化性能向上のための変数選択手法などを紹介する。さらに、画像認識への応用例も紹介する。Kernel method, which is also called Kernel Trick, is known to be one of the best scheme to extend linear classfier systems to nonlinear classifier systems. Support vector machine (SVM) is recognized as one of the best models for two class classification among the many methods, since its performace is drastically improved by kernel trick. Although we can build a high performance classifier system with combination of kernel method and linear classification method such as SVM, feature selection is still important to obtain high performance for unlearned data. This paper reviews kernel methods centering on the SVM and introduces some feature selection methods. Some examples of applications for image understanding are also introduced.

2011-04-08 21:12:48
1 はてなブックマーク

https://ci.nii.ac.jp/naid/10016156970

1 0 0 0 直線的手ぶれ画像復元のためのPSFパラメータ推定手法

著者: 米司健一田中正行奥富正敏
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2005, no.38, pp.47-52, 2005-05-12
被引用文献数: 8

画像撮影時の手ぶれや,対象が動くことによって画像にぶれが生じる.このぶれを等速直線運動で近似すると,ぶれを表すPSF(Point Spread Function)は幅と角度の2つのパラメータで表現することができる.劣化画像の振幅スペクトルは,PSFの幅と角度によって決まる方向と周期で0となる性質を持つ.この劣化画像の振幅スペクトルの周期性と方向性を検出することによって,PSFパラメータの幅ellと角度thetaを推定する.本論文では原画像の周波数特性によらず,劣化画像の振幅スペクトルの周期性と方向性をロバストに検出する手法を提案する.また,実画像実験を通して,提案手法の効果を確認した.An image is degraded by hand blurring or moving object. That degradation can be expressed by PSF(Point Spread Function). The PSF has two parameters of width and the angle, approximating the motion is uniform. An amplitude spectrum of blurred image has a feature based on PSF parameters. PSF parameters can estimate from this feature. This paper presents a new method to estimate PSF parameters from the amplitude spectrum of blurred image. The effect of the proposed method is confirmed by experiments.

2011-01-13 15:45:13
1 + 1 Twitter

https://ci.nii.ac.jp/naid/10016156466

1 0 0 0 全方位カメラを用いた複数方向の観測による歩容認証

著者: 杉浦一成槇原靖八木康史
出版者: 一般社団法人情報処理学会
雑誌: 情報処理学会研究報告コンピュータビジョンとイメージメディア(CVIM) (ISSN:09196072)
巻号頁・発行日: vol.2007, no.42, pp.187-194, 2007-05-15
被引用文献数: 2

本研究では、全方位カメラから得られる複数方向の歩容画像を用いた個人認証手法を提案する。最初に背景差分により全方位画像列からシルエットを抽出する。それをパノラマ展開し、時空間の歩容シルエットボリューム(GSV)を得る。次に GSV から算出した歩行周期に基づいてフーリエ解析を行い、周波数領域特徴を抽出する。また全方位カメラを用いることによる観測方向の変化を利用して、複数の基準方向を設定し、各基準方向と歩行周期を基に複数方向の特徴を抽出する。認証時には、入力と辞書シーケンスに対する同一方向同士の特徴間距離を算出し、それらを統合して照合を行う。最後に15人の被験者の5方向を含むシーケンスに対して個人認証実験を行い、本手法の有効性を確認した。We propose a method of gait identification based on multi-view gait images using an omnidirectional camera. We first transform omnidirectional silhouette images into panoramic ones and obtain a spatio-temporal gait silhouette volume (GSV). Next, we extract frequency-domain features by Fourier analysis based on gait periods estimated by autocorrelation of the GSVs. Because the omnidirectional camera provides a change of observation views, multi-view features can be extracted from parts of GSV corresponding to basis views. In an identification phase, distance between a probe and a gallery feature of the same view is calculated, and then these for all views are integrated for matching. Experiments of gait identification including 15 subjects from 5 views demonstrate the effectiveness of the proposed method.

2010-12-17 18:48:22
1 はてなブックマーク

https://ci.nii.ac.jp/naid/110006345179