Linked Open Data & Metadata
-
Carlo Meghini
Abstract
This article considers linked data, starting with the four rules drawn up in 2006 by the inventor of the web, Tim Berners-Lee, to produce this kind of data: (1) to use a web standard, the Internationalized Resource Identifier (IRI), to name things within the data; and, in particular, (2) to use IRIs of the HTTP protocol, so that data associated with these IRIs can be retrieved and accessed in exactly the same way that web pages are retrieved and accessed; (3) to use a second web standard, the Resource Description Framework (RDF), to format the data, and a third web standard, the SPARQL Query Language, to query those data. Finally, (4) to use IRIs from other datasets within the data, so as to connect one’s own data with those of other people. This article comments on these rules and discusses their implications, highlighting in particular the fact that they lay the bases for the creation of the Semantic Web, that is, a new web, parallel to the web for humans as we have known it for the last two decades. The Semantic Web is made up of pages containing formal knowledge expressed as linked data. This knowledge is consumed by artificial agents carrying out trivial, time-consuming, and error-prone tasks (such as counting the occurrences of a certain syntactic construct in Dante’s Comedia), freeing humans from such tasks and letting them use their time for more intellectual activities (such as figuring out the evolution of Dante’s culture). The vision of the Semantic Web is presented along with two basic ingredients for its establishment: the Resource Description Framework (RDF), the language for expressing linked data, and ontologies - that is, vocabularies - that axiomatize the definitions of the terms used in linked data. For the realization of the Semantic Web, RDF is necessary but not sufficient, because RDF provides the mere structure of linked data, without indicating any particular way to represent a specific domain. This is the role of ontologies, without which any linked data dataset would remain confined within a (possibly very small) community, defeating the vision of a common, global data space. Finally, the article discusses the role of the Semantic Web for the scholarly domain. In fact, linked open data and ontologies play a very important role in the scientific and scholarly world by offering tools for the creation and sharing of data and vocabularies. The key concept here is interdisciplinarity. It has been long recognized that significant progress can be achieved in all branches of science in research projects that are able to combine tools, data, and knowledge from different domains. Research infrastructures such as D4Science are complex systems that allow users to realize interdisciplinarity in science by offering scientists virtual research environments where they can find the tools, data, and knowledge that they need for their work. They also provide them with the communication and collaboration facilities that are necessary to cooperate with their colleagues. D4Sceince is also supporting the humanities with virtual research environments like those of the PARTHENOS and ARIADNE infrastructural projects.
Abstract
This article considers linked data, starting with the four rules drawn up in 2006 by the inventor of the web, Tim Berners-Lee, to produce this kind of data: (1) to use a web standard, the Internationalized Resource Identifier (IRI), to name things within the data; and, in particular, (2) to use IRIs of the HTTP protocol, so that data associated with these IRIs can be retrieved and accessed in exactly the same way that web pages are retrieved and accessed; (3) to use a second web standard, the Resource Description Framework (RDF), to format the data, and a third web standard, the SPARQL Query Language, to query those data. Finally, (4) to use IRIs from other datasets within the data, so as to connect one’s own data with those of other people. This article comments on these rules and discusses their implications, highlighting in particular the fact that they lay the bases for the creation of the Semantic Web, that is, a new web, parallel to the web for humans as we have known it for the last two decades. The Semantic Web is made up of pages containing formal knowledge expressed as linked data. This knowledge is consumed by artificial agents carrying out trivial, time-consuming, and error-prone tasks (such as counting the occurrences of a certain syntactic construct in Dante’s Comedia), freeing humans from such tasks and letting them use their time for more intellectual activities (such as figuring out the evolution of Dante’s culture). The vision of the Semantic Web is presented along with two basic ingredients for its establishment: the Resource Description Framework (RDF), the language for expressing linked data, and ontologies - that is, vocabularies - that axiomatize the definitions of the terms used in linked data. For the realization of the Semantic Web, RDF is necessary but not sufficient, because RDF provides the mere structure of linked data, without indicating any particular way to represent a specific domain. This is the role of ontologies, without which any linked data dataset would remain confined within a (possibly very small) community, defeating the vision of a common, global data space. Finally, the article discusses the role of the Semantic Web for the scholarly domain. In fact, linked open data and ontologies play a very important role in the scientific and scholarly world by offering tools for the creation and sharing of data and vocabularies. The key concept here is interdisciplinarity. It has been long recognized that significant progress can be achieved in all branches of science in research projects that are able to combine tools, data, and knowledge from different domains. Research infrastructures such as D4Science are complex systems that allow users to realize interdisciplinarity in science by offering scientists virtual research environments where they can find the tools, data, and knowledge that they need for their work. They also provide them with the communication and collaboration facilities that are necessary to cooperate with their colleagues. D4Sceince is also supporting the humanities with virtual research environments like those of the PARTHENOS and ARIADNE infrastructural projects.
Chapters in this book
- Frontmatter I
- Contents V
- Introduction 1
-
Part 1: Historiography
- The Historiographical Foundations of Digital Public History 17
- Crowdsourcing and User Generated Content: The Raison d’Être of Digital Public History 35
- Sharing Authority in Online Collaborative Public History Practices 49
- Shifting the Balance of Power: Oral History and Public History in the Digital Era 61
- Digital Public Archaeology 77
- Identities – a historical look at online memory and identity issues 87
- Digital Environmental Humanities 97
- Combining Values of Museums and Digital Culture in Digital Public History 107
- Open Access: an opportunity to redesign scholarly communication in history 121
- Past and Present in Digital Public History 131
- Digital Hermeneutics: The Reflexive Turn in Digital Public History? 139
-
Part 2: Contexts
- Archivists as Peers in Digital Public History 149
- History Museums: Enhancing Audience Engagement through Digital Technologies 165
- Interactive Museum & Exhibitions in Digital Public History Projects and Practices: An Overview and the Unusual Case of M9 Museum 175
- Digital Public History in Libraries 185
- Publishing Public History in the Digital Age 199
- “Learning Public History by doing Public History” 211
- Spaces: What’s at Stake in Their Digital Public Histories? 223
- Digital Public History in the United States 235
- Technology and Historic Preservation: Documentation and Storytelling 243
- Social Media: Snapshots in Public History 259
-
Part 3: Best Practices
- Curation: Toward a New Ethic of Digital Public History 277
- Data Visualization for History 291
- Mapping and Maps in Digital and Public History 301
- Gaming and Digital Public History 309
- Individuals in the Crowd: Privacy, Online Participatory Curation, and the Public Historian as Private Citizen 317
- Building Communities, Reconciling Histories: Can We Make a More Honest History? 327
- Cybermemorials: Remembrance and Places of Memory in the Digital Age 337
- Living History: Performing the Past 349
- Activist Digital Public History 359
- Digital Public History: Family History and Genealogy 369
- Digital Personal Memories: The Archiving of the Self and Public History 377
- Planning with the Public: How to Co-develop Digital Public History Projects? 385
- As Seen through Smartphones: An Evolution of Historic Information Embedment 395
-
Part 4: Technology, Media, Data and Metadata
- What does it Meme? Public History in the Internet Memes Era 405
- Historical GIS 419
- Content Management 431
- Linked Open Data & Metadata 439
- Big Data and Public History 447
- Modeling Data Complexity in Public History and Cultural Heritage 459
- History and Video Games 475
- Historians as Digital Storytellers: The Digital Shift in Narrative Practices for Public Historians 485
- The Audiovisual Dimension & the Digital Turn in Public History Practices 495
- Digital Public History and Photography 505
- Exploring Large-Scale Digital Archives – Opportunities and Limits to Use Unsupervised Machine Learning for the Extraction of Semantics 517
- Infographics and Public History 531
- List of Contributors 545
Chapters in this book
- Frontmatter I
- Contents V
- Introduction 1
-
Part 1: Historiography
- The Historiographical Foundations of Digital Public History 17
- Crowdsourcing and User Generated Content: The Raison d’Être of Digital Public History 35
- Sharing Authority in Online Collaborative Public History Practices 49
- Shifting the Balance of Power: Oral History and Public History in the Digital Era 61
- Digital Public Archaeology 77
- Identities – a historical look at online memory and identity issues 87
- Digital Environmental Humanities 97
- Combining Values of Museums and Digital Culture in Digital Public History 107
- Open Access: an opportunity to redesign scholarly communication in history 121
- Past and Present in Digital Public History 131
- Digital Hermeneutics: The Reflexive Turn in Digital Public History? 139
-
Part 2: Contexts
- Archivists as Peers in Digital Public History 149
- History Museums: Enhancing Audience Engagement through Digital Technologies 165
- Interactive Museum & Exhibitions in Digital Public History Projects and Practices: An Overview and the Unusual Case of M9 Museum 175
- Digital Public History in Libraries 185
- Publishing Public History in the Digital Age 199
- “Learning Public History by doing Public History” 211
- Spaces: What’s at Stake in Their Digital Public Histories? 223
- Digital Public History in the United States 235
- Technology and Historic Preservation: Documentation and Storytelling 243
- Social Media: Snapshots in Public History 259
-
Part 3: Best Practices
- Curation: Toward a New Ethic of Digital Public History 277
- Data Visualization for History 291
- Mapping and Maps in Digital and Public History 301
- Gaming and Digital Public History 309
- Individuals in the Crowd: Privacy, Online Participatory Curation, and the Public Historian as Private Citizen 317
- Building Communities, Reconciling Histories: Can We Make a More Honest History? 327
- Cybermemorials: Remembrance and Places of Memory in the Digital Age 337
- Living History: Performing the Past 349
- Activist Digital Public History 359
- Digital Public History: Family History and Genealogy 369
- Digital Personal Memories: The Archiving of the Self and Public History 377
- Planning with the Public: How to Co-develop Digital Public History Projects? 385
- As Seen through Smartphones: An Evolution of Historic Information Embedment 395
-
Part 4: Technology, Media, Data and Metadata
- What does it Meme? Public History in the Internet Memes Era 405
- Historical GIS 419
- Content Management 431
- Linked Open Data & Metadata 439
- Big Data and Public History 447
- Modeling Data Complexity in Public History and Cultural Heritage 459
- History and Video Games 475
- Historians as Digital Storytellers: The Digital Shift in Narrative Practices for Public Historians 485
- The Audiovisual Dimension & the Digital Turn in Public History Practices 495
- Digital Public History and Photography 505
- Exploring Large-Scale Digital Archives – Opportunities and Limits to Use Unsupervised Machine Learning for the Extraction of Semantics 517
- Infographics and Public History 531
- List of Contributors 545