Thanks for your patience during our recent outage at scalar.usc.edu. While Scalar content is loading normally now, saving is still slow, and Scalar's 'additional metadata' features have been disabled, which may interfere with features like timelines and maps that depend on metadata. This also means that saving a page or media item will remove its additional metadata. If this occurs, you can use the 'All versions' link at the bottom of the page to restore the earlier version. We are continuing to troubleshoot, and will provide further updates as needed. Note that this only affects Scalar projects at scalar.usc.edu, and not those hosted elsewhere.
DHRI@A-State
Main Menu
About
DHRI@A-State
Curriculum
Learn Skills, Share Resources
Community
Contact, Events
The Faculty Center
Encoding
1 2019-02-17T18:13:11-08:00 Andrea Davis e50475e163fb87bc8bd10c6c0244468fd91e8da5 31636 1 Example of encoding Dublin Core Metadata plain 2019-02-17T18:13:12-08:00 Andrea Davis e50475e163fb87bc8bd10c6c0244468fd91e8da5This page is referenced by:
-
1
media/1505283b821541de196157757c467ead.jpg
2019-01-04T13:05:44-08:00
Constructing Data
301
plain
2019-02-19T14:54:54-08:00
As a model of knowledge, datasets not only represent information for the purposes of study. They also embody a range of ideas and practices related to the broader politics of representation, inclusion, and access. Therefore, as Trevor Owens of the Library of Congress explains, constructing a dataset requires careful "choices about what and how to collect and how to encode the information" [1].
Collecting
Roopika Risam (on reproducing colonial knowledge and also on archive that contextualize colonial knowledge- think of Vanishing Race)
Participatory and Community Based archive- example Lesbian Herstory Archives pg 140 Intersectional Feminism
Posner radical new digital humanities (shifts focus from mainstream to marginalized but also develops ways of representing people's lives in data "as they have been experienced, not as they have been captured and advanced by businesses and governments".Collection Activity
Encoding
For digital collections, item information is frequently encoded in relational databases. A relational database is a structured set of data that contains a series of formally described tables from which data can be queried or organized in different ways. This structure enhances searching, browsing, and exhibition functions by enabling users to access items and reassemble collections based on tabular information.
To create a collection with these functions, you can either build a relational database or use a content management system (CMS) that functions as a relational database [2].
Either way, before getting started you will want to develop a metadata schema for your collection. Metadata, or data about data, "provides a means of indexing, accessing, preserving, and discovering digital resources" [3]. While metadata can serve different functions related to these tasks, in this workshop we will be primarily concerned with its descriptive function [4]. Specifically, you will learn how to encode individual items with descriptive metadata and develop a schema to ensure that all of the items in your collection are described consistently to facilitate human and computational analysis.
To create your metadata schema, it is generally advisable to build upon established ontologies and/or controlled vocabularies. Not only will this help you structure your data, but, ultimately, it will allow you to integrate your collection into Linked Data initiatives. For the purposes of this workshop, we will build our schemas on Dublin Core (DC), which is one of the most popular ontologies for describing digital resources. Dublin Core consists of fifteen elemental terms (in italics below) and a series of qualified terms that extend or refine the original fifteen elements. Many of these terms, in turn, can be linked to controlled vocabularies, such as those developed by the Library of Congress and the Getty Institute.
Before building a metadata schema based on Dublin Core, it is worth drawing attention to DC alternatives and critiques. While Dublin Core is explicit in its cross-disciplinary aim to transcend the " boundaries of information silos on the Web and within intranets," you may have reasons to use a more specialized ontology, such as the VRA Core for visual culture, the PBCore for audiovisual content, and the IPTC for media resources [5]. Alternatively, you may have reasons to revise or eschew established ontologies and vocabularies. Whether rooted in efforts to 'decolonizing knowledge' or "represent uncertainty, historical contingency, conflict, variation, instability, and multidimensionality," the stakes in today's networked knowledge society can be high [6]. As Michelle Schwartz and Constance Crompton write about their decision to reject "existing data models and international standards" in Lesbian and Gay Liberation in Canada (an interactive digital resource for the study of lesbian, gay, bisexual, and transgender history in Canada from 1964 to 1981) "in an age where we recognize the right to be forgotten, we must also weigh the danger... of hiding our history" [7].Metadata Schema Activity
In this activity you will develop a metadata schema for your collection based on Dublin Core. As the previous discussion should make clear, the objective of this activity is to practice using metadata to structure a collection, not to convince you that all collections should be encoded with Dublin Core.
To create your schema: consult the Dublin Core Elements and Qualifiers manuals; note that each term is optional and can be repeated; and keep in mind that many DC terms can be paired with controlled vocabularies. Your schema should include your selected DC terms with collection-specific directions and examples for each (along the lines of the above example).