著者
大隅 昇
出版者
日本行動計量学会
雑誌
行動計量学 (ISSN:03855481)
巻号頁・発行日
vol.29, no.1, pp.20-44, 2002 (Released:2009-04-07)
参考文献数
26
被引用文献数
16 8

Rapid development in the survey methods available on the World Wide Web (WWW) is having a major impact on conventional survey data collection methods. The wide range of opinions has given rise to an ongoing debate regarding the future role of Internet surveys (in particular, Web surveys) based on the role that self-administration will play in research. We started by arranging a practical procedure for electronic data collection on the Web surveys experimentally designed from the viewpoint of “data science. Aiming to verify the applicability, possibilities, and limitations of Web survey methods, we conducted three experimental surveys during the period from 1997 to 2000. They were designed to enable comparison with each other and with traditional methods such as face-to-face interviews and online surveys using conventional sampling procedures. These surveys provided informative results about the characteristics of Web surveys. In the first survey, consisting of 12 continual surveys of a single panel of registrants, we examined the relationship between the response rates and the questionnaire's design, volume and content, as well as response rate differences among the 12 surveys and the discrepancies in repeated surveys. In the second experimental survey, we carried out Web surveys at about the same time on three different sites together with non-internet surveys using conventional sampling methods. Our experimental design enabled objective comparison of the surveys by using as much identical questionnaire design as possible. Our experimental surveys showed that Web survey results are similar to each other while distinctively differing from those of conventional surveys. In the third experimental survey, we simultaneously carried out a series of comparative surveys in order to examine the general characteristics of Web surveys found during the second experimental survey. Except that the number of sites used was two instead of three, the third experimental surveys were carried out in the same way as in the second trial. We confirmed the results that the same characteristics were evident again in the second survey. We also found that how the registrants of the surveys (named “resources”) were selected and whether the interval between solicitation and survey was short or long would be factors influencing the answers and response-rate. We also found that the respondents do not necessarily represent the resources. In addition, as an addendum in this paper, we report partly the results of a fourth experimental survey which has been carried out in 2001 to 2002 and compare it with the findings of the previous three trials. In particular, we also analyze the itemized causes of “nonresponse” on the datasets obtained from the tracking procedure of tracing electronically each respondent on the WWW. The fourth survey consists of Web surveys on three separate sites while the other surveys were based on conventional sampling methods (e.g., face-to-face interviews and mail surveys). While we use the same questionnaire design, content, and duration as those used in the past surveys, we also attempt to examine how the questionnaire design has influenced responses. Through these experimental surveys, an appropriate route to how to design a Web survey, evaluate its quality and avoid possible risks or perils in design is proposed from the concept of “data science.”
著者
大隅 昇 保田 明夫
出版者
数理社会学会
雑誌
理論と方法 (ISSN:09131442)
巻号頁・発行日
vol.19, no.2, pp.135-159, 2004-09-30 (Released:2008-12-22)
参考文献数
54
被引用文献数
3

ここではまず,テキスト・マイニング(TM)あるいはテキスト型データのマイニング(TDM)の特徴を俯瞰すると同時に,これに関わる技術的な諸要素,諸事項について総合的に報告する.つぎに,現状考えられるTMを実際データの分析に用いるうえでの諸問題を整理する.とくに,その適用可能性について,データ科学の視点から問題解決を図ることの重要性について触れ,さらに具体的なTM応用ソフトを紹介する.また,筆者等が独自に行ったWeb調査データによる分析例を通じ,どのような使い方ができるかの要点,留意事項を示す.ここでは,自由回答設問で得た情報と通常の選択肢型設問との併用による定性型情報の計量的評価の例として示すが,これはTMのごく一部の具現化に過ぎず,本来のTMのあるべき姿,目標はこれだけではない.このようなことからTMの今後の進むべき道あるいは期待される方向は何かについての私見を述べる.
著者
柳澤 幸雄 大隅 昇
出版者
応用統計学会
雑誌
応用統計学 (ISSN:02850370)
巻号頁・発行日
vol.8, no.2, pp.51-71, 1980-01-20 (Released:2009-06-12)
参考文献数
7
被引用文献数
2 4

数多くのクラスタリングの技法のうち連結性または距離の順位性のみに依存するsingle linkage法とcomplete linkage法を取り上げ,技法とクラスターの関係を"凝塊性"の立場から測るいくつかの基準量を用意し,クラスター数の評価を行う問題をとりあげる.さらにそれを利用して技法の性質,あるいは技法ごとのクラスター化の過程を推し測る方法として"感度分析"の考え方を導入する.そして布置の明確な人工データを用いて,感度分析の効用を実験により具体的に検討する.この実験を通して,感度分析が技法の性質と安定性,データの凝塊性の検討に有効であることを示す,
著者
大隅 昇
出版者
日本行動計量学会
雑誌
行動計量学 (ISSN:03855481)
巻号頁・発行日
vol.29, no.1, pp.20-44, 2002-03-29
被引用文献数
6 8

Rapid development in the survey methods available on the World Wide Web (WWW) is having a major impact on conventional survey data collection methods. The wide range of opinions has given rise to an ongoing debate regarding the future role of Internet surveys (in particular, Web surveys) based on the role that self-administration will play in research. We started by arranging a practical procedure for electronic data collection on the Web surveys experimentally designed from the viewpoint of "data science." Aiming to verify the applicability, possibilities, and limitations of Web survey methods, we conducted three experimental surveys during the period from 1997 to 2000. They were designed to enable comparison with each other and with traditional methods such as face-to-face interviews and online surveys using conventional sampling procedures. These surveys provided informative results about the characteristics of Web surveys. In the first survey, consisting of 12 continual surveys of a single panel of registrants, we examined the relationship between the response rates and the questionnaire's design, volume and content, as well as response rate differences among the 12 surveys and the discrepancies in repeated surveys. In the second experimental survey, we carried out Web surveys at about the same time on three different sites together with non-internet surveys using conventional sampling methods. Our experimental design enabled objective comparison of the surveys by using as much identical questionnaire design as possible. Our experimental surveys showed that Web survey results are similar to each other while distinctively differing from those of conventional surveys. In the third experimental survey, we simultaneously carried out a series of comparative surveys in order to examine the general characteristics of Web surveys found during the second experimental survey. Except that the number of sites used was two instead of three, the third experimental surveys were carried out in the same way as in the second trial. We confirmed the results that the same characteristics were evident again in the second survey. We also found that how the registrants of the surveys (named "resources") were selected and whether the interval between solicitation and survey was short or long would be factors influencing the answers and response-rate. We also found that the respondents do not necessarily represent the resources. In addition, as an addendum in this paper, we report partly the results of a fourth experimental survey which has been carried out in 2001 to 2002 and compare it with the findings of the previous three trials. In particular, we also analyze the itemized causes of "nonresponse" on the datasets obtained from the tracking procedure of tracing electronically each respondent on the WWW. The fourth survey consists of Web surveys on three separate sites while the other surveys were based on conventional sampling methods (e.g., face-to-face interviews and mail surveys). While we use the same questionnaire design, content, and duration as those used in the past surveys, we also attempt to examine how the questionnaire design has influenced responses. Through these experimental surveys, an appropriate route to how to design a Web survey, evaluate its quality and avoid possible risks or perils in design is proposed from the concept of "data science."