著者
村田 祐菜 永崎 研宣 大向 一輝
出版者
日本デジタル・ヒューマニティーズ学会
雑誌
デジタル・ヒューマニティーズ (ISSN:21897867)
巻号頁・発行日
vol.3, no.1, pp.17-26, 2022-12-31 (Released:2022-12-31)
参考文献数
27

文学研究をはじめとした諸分野において, 一定規模のテキストを素早く, 網羅的に検索可能な環境が整備されていることは研究基盤の点から重要である.しかし,近代短歌は研究に利用可能な電子テキストの蓄積が十分ではない.また,日本文学研究者はプログラミング技術等を用いたテキストの処理・分析手法をとる場合はいまだ少なく,テキストの構築に加え,データの利用環境としての検索・分析インターフェースの整備も必要である.そこで研究データとしての近代短歌の電子テキストの作成及びその利用環境としての全文検索システムを構築し,「近代短歌データベース」として公開した.本稿ではその構築過程と実現した機能の詳細,近代短歌研究における利用事例について述べる.
著者
岡田 一祐
出版者
日本デジタル・ヒューマニティーズ学会
雑誌
デジタル・ヒューマニティーズ (ISSN:21897867)
巻号頁・発行日
vol.2, pp.26, 2020-11-20 (Released:2020-11-20)
参考文献数
11
被引用文献数
32

TEIをもとにした日本古辞書の効率的な符号化モデルについて論ずる。日本の古辞書(1615年以前の日本編纂辞書)、具体的には平安時代の漢字字書を本稿では例とする。古辞書はしばしば構造を見いだしがたく資料として利用しにくい。そのような懸隔を構造注記によって補いたい。また、これらの資料を共通のモデルで符号化することで、資料間の構造差が見いだしやすくなることが期待される。ここで用いるスキーマは、TEI(Text Encoding Initiative)で、国際的に用いられている本文符号化の取り決めである。さまざまな符号化の考え方を包摂し、そのなかには辞書や語彙データベースに関するものもある。TEIは現状東アジアの古典的辞書への適用が十分に検討されているわけではないので、符号化に際しては考慮すべきことが多い。本稿では、古辞書に見られるさまざまな要素をどのように符号化することが情報交換において望ましいか論ずる。
著者
Katsuya Masuda Makoto Tanji Hideki Mima
出版者
日本デジタル・ヒューマニティーズ学会
雑誌
Journal of the Japanese Association for Digital Humanities (ISSN:21887276)
巻号頁・発行日
vol.1, no.1, pp.37-43, 2015-09-02 (Released:2015-09-02)
参考文献数
2
被引用文献数
2

This study proposes a framework to access to the modern history of Japanese philosophy using natural language processing (NLP) and visualization. In order to discover new knowledge from massive amounts of information, support of information technologies is required. For supporting knowledge discovery from vast amount of books, we developed an OCR-based automatic book-digitizing framework and the system visualizing documents with relationships among them calculated by using NLP techniques. We applied the framework to Japanese journal Shisō (“Thought”) by the Japanese publisher Iwanami Shoten. We show an example of knowledge structure extracted from Shisō by using our visualizing system.
著者
Jennifer Edmond Natasa Bulatovic Alexander O'Connor
出版者
日本デジタル・ヒューマニティーズ学会
雑誌
Journal of the Japanese Association for Digital Humanities (ISSN:21887276)
巻号頁・発行日
vol.1, no.1, pp.107-122, 2015-09-02 (Released:2015-09-02)
参考文献数
17
被引用文献数
2

The Collaborative EuropeaN Digital Archival Research Infrastructure (CENDARI) project has developed a new virtual environment for humanities research, reimagining the analogue landscape of research sources for medieval and modern history and humanities research infrastructure models for the digital age. To achieve this, the project has needed to be sensitive to the ways in which historical research practices in the 21st Century are distinct from those of earlier eras, harnessing the affordances of technology to reveal connections and support or refute hypotheses, enabling transnational approaches, and federating sources beyond the well-known and across the largely national organization paradigms that dominate within traditional knowledge infrastructures (libraries, archives and museums). This paper describes both the user-centered development methodology deployed by the project and the resulting technical architecture adopted to meet these challenging requirements. The resulting system is a robust ‘enquiry environment’ able to integrate a variety of data types and standards with bespoke tools for the curation, annotation, communication and validation of historical insight.
著者
Yui Arakawa Ryosuke Yoshimoto Fuyuki Yoshikane Takafumi Suzuki
出版者
日本デジタル・ヒューマニティーズ学会
雑誌
Journal of the Japanese Association for Digital Humanities (ISSN:21887276)
巻号頁・発行日
vol.1, no.1, pp.1-9, 2015-09-02 (Released:2015-09-02)
参考文献数
13

Gathering information from social media content is becoming increasingly popular. Twitter, a microblog where posts are limited to 140 characters, is an excellent platform for gathering instant and interactive information. Considerable research has focused on Twitter’s effectiveness for disseminating emergency alerts and confirming the safety of acquaintances. However, there has been less emphasis on the analysis of Twitter posts to obtain information specialized to specific domains. Such analysis could enable simple and rapid identification of information related to state-of-the-art technology. Against this background, this study reports on a preliminary analysis of tweets by Japanese academic researchers. Our content analysis and text analysis reveal that many academic researchers tweet about their individual activities, education, or research. Their tweets contain domain-specific knowledge and have identifiable textual characteristics. This study provides basic findings that can be applied to obtain domain-specific knowledge from Twitter.
著者
中村 覚 大和 裕幸 稗方 和夫 満行 泰河
出版者
日本デジタル・ヒューマニティーズ学会
雑誌
デジタル・ヒューマニティーズ (ISSN:21897867)
巻号頁・発行日
vol.1, pp.29-43, 2018 (Released:2019-01-18)
参考文献数
25

近年,デジタルアーカイブの普及に伴い,セマンティックウェブ技術を活用したデジタルアーカイブの統合や,年表や地図等との組み合わせによる多角的な情報提供による史料研究支援が試みられている.このような取り組みは効率的な史料研究を支援する点で重要であるが,これらはデジタルアーカイブを公開する提供者による取り組みが主である.歴史研究者をはじめとする史料の利用者が,公開されている史料を収集・整理し,史料分析を支援する環境は整えられていない.本研究では,SPARQL Endpoint を提供するデジタルアーカイブが公開する情報と,研究者が個々に整理する情報を Linked Data を用いて関連づけ, 研究者の目的に応じた史料分析を支援するシステムを開発する.また,異なる 2 つのデータベースを対象としたケーススタディを行い,提案手法の有用性を検証する.
著者
Tomohiko Morioka
出版者
日本デジタル・ヒューマニティーズ学会
雑誌
Journal of the Japanese Association for Digital Humanities (ISSN:21887276)
巻号頁・発行日
vol.1, no.1, pp.86-106, 2015-09-02 (Released:2015-09-02)
参考文献数
11

This paper describes a knowledge based character processing model to resolve some problems of coded character model. Currently, in the field of information processing of digital texts, each character is represented and processed by the “Coded Character Model.” In this model, each character is defined and shared using a coded character set (code) and represented by a code-point (integer) of the code. In other words, when knowledge about characters is defined (standardized) in a specification of a coded character set, then there is no need to store large and detailed knowledge about characters into computers for basic text processing. In terms of flexibility, however, the coded character model has some problems, because it assumes a finite set of characters, with each character of the set having a stable concept shared in the community. However, real character usage is not so static and stable. Especially in Chinese characters, it is not so easy to select a finite set of characters which covers all usages. To resolve these problems, we have proposed the “Chaon” model. This is a new model of character processing based on character ontology. This report briefly describes the Chaon model and the CHISE (Character Information Service Environment) project, and focuses on how to represent Chinese characters and their glyphs in the context of multiple unification rules.
著者
Hajime Murai
出版者
日本デジタル・ヒューマニティーズ学会
雑誌
Journal of the Japanese Association for Digital Humanities (ISSN:21887276)
巻号頁・発行日
vol.1, no.1, pp.44-57, 2015-09-02 (Released:2015-09-02)
参考文献数
11

To support the automatic semantic analysis of texts in the humanities, it is not sufficient to analyze words and evaluate word pairs, because it is necessary to process larger units, such as phrases, sentences, and paragraphs. This study proposes the introduction of intratextuality into a digital archive system. In the future, this method will be developed as the basis for semantic analysis of larger units. Classical literary structures that are used frequently in the Old and New Testaments were digitized as a case study. A literary structure data format for a relational database was also implemented. The literary structures of 39 books in the Old Testament and 27 books in the New Testament were digitized. The total number of digitized literary structures was 1,507 and the elements of these structures comprised 7,715 pairs. These data were stored in a Java-based relational database system and a web-based viewer program for rhetorical structures was implemented as a JSP servlet. This web-based program will be combined with an existing digital archive system that can manage intertextuality data. The Java-based relational database system and the JSP servlet will facilitate numerical analyses of the intertextuality and intratextuality of digital archive systems of classical texts, thus making it much easier to conduct scientific analyses of the meanings of texts.
著者
James Smithies Paul Millar Chris Thomson
出版者
日本デジタル・ヒューマニティーズ学会
雑誌
Journal of the Japanese Association for Digital Humanities (ISSN:21887276)
巻号頁・発行日
vol.1, no.1, pp.10-36, 2015-09-02 (Released:2015-09-02)
参考文献数
28
被引用文献数
3

The UC CEISMIC Canterbury Earthquakes Digital Archive was established in response to the devastating earthquakes that struck Canterbury region in New Zealand from September 2010 onwards, including 4 quakes of magnitude 6 or greater and over 11,000 aftershocks. 185 people died and significant parts of Christchurch city were either destroyed or have needed to be demolished, resulting in financial losses of an estimated NZ$30 billion. The rebuild is expected to take 10 – 15 years, and the UC CEISMIC archive is designed to accommodate this, acting as a distributed national (and eventually international) repository for digital content produced as a result of the earthquakes. This paper outlines the design principles and architecture of the archive, describing the commitment to open access and open source that allowed the project team to bring together a broad-ranging national consortium comprised of leading cultural organizations, who work alongside content providers ranging from individual citizens, government agencies and community groups, to large media companies. Principles common to the digital humanities community were used to bond the broader project team, in an interesting example of scholar-led community engagement. The goal is to provide a model that can be used, either in whole or in part, by future teams in need of similar capability.
著者
河瀬 彰宏 髙木 優貴
出版者
日本デジタル・ヒューマニティーズ学会
雑誌
デジタル・ヒューマニティーズ (ISSN:21897867)
巻号頁・発行日
vol.2, pp.3, 2020-11-20 (Released:2020-11-20)
参考文献数
25

日本の伝統音楽の基盤であるわらべうたは,子どもが遊びの中で創作した自然発生的な音楽である.わらべうたと類似した音楽に童謡がある.両者の音楽は子どもに親しまれているが,童謡は大人が子どもに親しんでもらう明確な意図のもとで作曲された音楽という点でわらべうたと異なる.本研究では,子どもたちが自然な感覚で創作したわらべうたの旋律的特徴のことを「子どもらしさ」と定義し,童謡の旋律がもつ「子どもらしさ」の表現方法を計量的に明らかにした.わらべうた,童謡,同時代の日本の流行歌の旋律から音程と音価を抽出し,それらを特徴量とする計量比較を実施した.3者の共通点と相違点を明確に説明する特徴量を考察することで,童謡の旋律がもつ「子どもらしさ」が限定された範囲内での音程推移と跳躍のあるリズムの組み合わせによって作られていることが明らかになった.
著者
Yusuke Nakamura Chikahiko Suzuki Katsuya Masuda Hideki Mima
出版者
日本デジタル・ヒューマニティーズ学会
雑誌
Journal of the Japanese Association for Digital Humanities (ISSN:21887276)
巻号頁・発行日
vol.2, no.1, pp.60-72, 2017-09-06 (Released:2017-09-07)
参考文献数
3
被引用文献数
1

The present paper aims at designing a monitoring framework for a yet new interdisciplinary research and education program in Japan, “Cultural Resources Studies.”, "Bunkashigengaku" in Japanese. We analyze the linkage between a university, an academic association, and the practitioners’ institutions closely related with cultural resources through the mining of the principal texts produced by them. Our findings reveal the complicated relations among these stakeholder institutions, and attest to the importance of the revision cycle for the advance of interdisciplinary studies.