Since 2019, experts at the University of North Carolina at Chapel Hills University Libraries have investigated the use of machine learning to identify racist laws from North Carolinas past. Now a grant of $400,000 from The Andrew W. Mellon Foundation will allow them to extend that work to two more states. The grant will also fund research and teaching fellowships for scholars interested in using the projects outputs and techniques.
On the Books: Jim Crow and Algorithms of Resistance began with a question from a North Carolina social studies teacher: Was there a comprehensive list of all the Jim Crow laws that had ever been passed in the state?
Finding little beyond scholar and activist Pauli Murrays 1951 book States laws on race and color, a team of librarians, technologists and data experts set out to fill the gap. The group created machine-readable versions of all North Carolina statutes from 1866 to 1967. Then, with subject expertise from scholarly partners, they trained an algorithm to identify racist language in the laws.
We identified so many laws, said Amanda Henley, principal investigator for On the Books and head of digital research services at the University Libraries. There are laws that initiated segregation, which led to the creation of additional laws to maintain and administer the segregation. Many of the laws were about school segregation. Other topics included indigenous populations, taxes, health care and elections, Henley said. The model eventually uncovered nearly 2,000 North Carolina laws that could be classified as Jim Crow.
Henley said that On the Books is an example of collections as datadigitized library collections formatted specifically for computational research. In this way, they serve as rich sources of data for innovative research.
The next phase of On the Books will leverage the teams learnings through two activities:
Weve gained a tremendous amount of knowledge through this project everything from how to prepare data sets for this kind of analysis, to training computers to distinguish between Jim Crow and not Jim Crow, to creating educational modules so others can use these findings. Were eager to share what weve learned and help others build upon it, said Henley.
On the Books began in 2019 as part of the national Collections as Data: Part to Whole project, funded by The Andrew W. Mellon Foundation. Subsequent funding from the ARL Venture Fund and from the University Libraries internal IDEA Action grants allowed the work to continue. The newest grant from The Mellon Foundation will conclude at the end of 2023.