Visions and open challenges for a knowledge-based culturomics
Artikel i vetenskaplig tidskrift, 2015

The concept of culturomics was born out of the availability of massive amounts of textual data and the interest to make sense of cultural and language phenomena over time. Thus far however, culturomics has only made use of, and shown the great potential of, statistical methods. In this paper, we present a vision for a knowledge-based culturomics that complements traditional culturomics. We discuss the possibilities and challenges of combining knowledge-based methods with statistical methods and address major challenges that arise due to the nature of the data; diversity of sources, changes in language over time as well as temporal dynamics of information in general. We address all layers needed for knowledge-based culturomics, from natural language processing and relations to summaries and opinions.

Digital humanities

Statistical analysis

eScience

Culturomics

eInfrastructure

Natural language processing

Temporal text analysis

Knowledge-based analysis

Författare

Nina Tahmasebi

Göteborgs universitet

Lars Borin

Göteborgs universitet

Gabriele Capannini

Chalmers, Data- och informationsteknik, Datorteknik

Devdatt Dubhashi

Chalmers, Data- och informationsteknik, Datavetenskap

Peter Exner

Lunds universitet

Markus Forsberg

Göteborgs universitet

Gerhard Gossen

L3S Research Center

Fredrik Johansson

Chalmers, Data- och informationsteknik, Datavetenskap

Richard Johansson

Göteborgs universitet

Mikael Kågebäck

Chalmers, Data- och informationsteknik, Datorteknik

Olof Mogren

Chalmers, Data- och informationsteknik, Datavetenskap

Pierre Nugues

Lunds universitet

Thomas Risse

L3S Research Center

International Journal on Digital Libraries

1432-5012 (ISSN) 14321300 (eISSN)

Vol. 15 2-4 169-187

Styrkeområden

Informations- och kommunikationsteknik

Ämneskategorier

Språkteknologi (språkvetenskaplig databehandling)

Datavetenskap (datalogi)

DOI

10.1007/s00799-015-0139-1

Mer information

Senast uppdaterat

2023-02-22