As the curtain rises on this course, several class meetings remain unstructured. This (hopefully) will allow us to take advantage of a few different kinds of opportunities: topics identified during our discussion, guest speakers, and topics that simply need more time for discussion.
Here's a gcal view of the course, in case it's helpful:
Class | Topic | Activities |
---|---|---|
Class 1
Aug 27 |
Introductions, Two Cultures |
Welcome to class.... Today we'll introduce ourselves and and talk about where this class should take us. Be prepared to talk a bit about the research questions you're asking these days, and how they might relate to this class. (Research questions like, "What should I be researching?" are perfectly admissible.) Also, we'll talk about situating ourselves and this class in the context of C.P. Snow's famous "two cultures" lecture. To that end, please read the essay version of his lecture (the link provides both HTML and PDF versions), and reflect on how well the "two cultures" reflects--or doesn't--your scholarly background and aspirations. How are Snow's beliefs and concerns relevant to the contemporary practices of the digital humanities? How are they related to Taylor's slightly more recent perspective?
Fill out this very short survey about your interests and experience. Make sure you've received my email about the CUNY Academic Commons and that you've joined the course group. We'll discuss how to use the Commons for this course. |
Class 2
Sept 3 |
Computing and Humanities:
Fragments of History |
Talking about "computational problems in the digital humanities " has a (perhaps) surprisingly long history and complex context. Hockey provides a fairly detailed early history, focusing on text analysis; McCarty provides a more personal long-term perspective. Svensson and Kirschenbaum give more recent "macro level" histories. How is computing (in general) and computer science (as a 'science' or discipline) represented in these histories?
We'll formulate research groups today. |
Class 3
Sept 17 |
Images, Vision |
Images and their analysis are also the subject of DH inquiry, though this is perhaps a newer flavor of inquiry than text-based inquiry is. The first two papers (and the supporting survey data) describe an interesting proto-collaboration between computer scientists and art historians. Saleh et al describe a computational approach to making claims of influence between artists, based on computational analysis of images of paintings and machine learning applied to features derived from that analysis. Responding to that paper and its popular reception, Spratt and Elgammal reflect on the significance and possibility of deeper collaboration between art historians and computer scientists Chung et al and Toler-Franklin et al are fairly technical treatments of how computer vision techniques have been applied to two humanistic problems. Chung et al analyze and classify woodblock prints from a large corpus of ballads held at the Bodleian Library; Toler-Franklin et al apply computer vision and machine learning techniques to help re-assemble shattered classical frescoes.
We'll check in about the groups' research topics, and we'll spend a little time collectively formulating the structure for the "mini-presentations" we'll do in the last few class meetings. |
Class 4
Sept 24 |
A Model Topic: Topic Modeling |
So let's look at an actual computational problem: topic modeling. Today's readings illustrate a relatively gentle computer science perspective (in an overview article published in Communications of the ACM) as well as a couple DH-flavored applications, in articles co/authored by David Mimno, a CS/IS professor at Cornell and currently the guardian of the MALLET natural language processing toolkit. Read them in any order you like, paying attention to how computer science and the humanities each manifest in these articles.
We'll talk about everyone's mini-session topics, and develop a common structure for those sessions (which I'll post). I'll also propose a schedule for the mini-sessions, one that aims to cluster related sessions into one meeting. If that's possible. |
Class 5
Oct 1 |
Text Encoding |
As the histories we read a couple weeks ago suggest, text is a central focus of many digital humanists. One major problem is the digital encoding of texts (which can be exceptionally complicated things) in a way that best supports their study and dissemination. This week, we'll focus on the biggest player in humanities text encoding: the Text Encoding Initiative (TEI). Much technical and political effort has gone into producing a text encoding standard for the humanities that seems likely to stand the test of time; "TEI: History" gives a brief history of the development of the TEI, while Renear gives a longer (and somewhat out of date, ending c. 2004) history of the problem of text encoding. Text encoding mainly from concerns about the dissemination and preservation of text, but it also can be used to support analyses that would be extraordinarily difficult to perform on "plain" text. "Text Analysis and the TEI" is an exercise in analyzing heavily-encoded text (in this case, Romeo and Juliet); do the exercise and spend some time playing with the possibilities. Duhaime writes of a larger exploration of the corpus of Shakespeare's plays. What kinds of questions/problems could you imagine exploring with this sort of encoded text?
If youu knowledge of XML feels rusty, you may also want to look at the TEI's Gentle Introduction to XML. We'll share group topics and check-in on project status. |
Class 6
Oct 8 |
Music and Computing |
Musicology is embracing digital methods, too; in many ways, the digital representation, manipulation, and analysis of music offers even greater challenges than similar work with text. Papdopoulos gives a (slightly out-of-date) survey of techniques for algorithmically generating musical works. Pugin offers a perspective on current problems in digital musicology. The Music Encoding Initiative provides a history and introduction to some of the challenges of encoding music in rough analogy with the Text Encoding Initiative's work. Pugin et al (Verovio) and meiView illustrate some applications of the MEI. Randolph introduces a completely different sort of computational music challenge.
|
Class 7 Oct 15 |
Access |
One of the digital humanities' general imperatives is improving our access both to archival materials (eg by digitizing texts and publishing them online) and to scholar materials (eg by publishing in "open access" fora, or by using and producing "open source" code). Spiro argues this impulse is, in part, a defining one for DH. But are these forms of openness enough? Making sure people of all abilities have access to digital resources is a computational problem that goes well beyond the digital humanities but certainly affects them. How does "data visualization," for example, work for people whose vision is outside 'normal' parameters? Could data visualizations be translated into data visceralizations? Williams suggests that DH projects should hew to principles of "universal design," and reports on one project aiming to "make the digital humanities more open." Heron offers an example of how to think through accessibility in the context of a reasonably complex text-based application game.
|
Class 8 Oct 22 |
Copyright, Property, Rights, Freedom | As James pointed out, our ability to study things can be limited by our right, or lack thereof, to study them. Which is to say, some limitations on access are not rooted in questions of ability but in political/economic/legal systems which can erect or eradicate barriers to access. (Such as the occasional reading for this class that requires access throuh the CUNY libraries.) It's amazingly difficult to find readings on this topic, either for academic study in general or for DH in particular. But for starters...
As you read these, be thinking about some broader issues (which these readings don't necessarily touch directly) about computing, access, and DH. How can (should?) the DH community agitate for more open access to data? (Where this engages copyright and DRM but also proprietary restrictions like corporate APIs). To what extent should we accept the "information landscape" with its barriers and blockages, and how much should we be working to change it? What can DH scholars do to engage these challenges? Very last-minute addition (perhaps more for posterity): a paper on the implications of DMCA for the practice of science (and, by extention, I think, software-mediated research: Anticircumvention Rules: Threat to Science, Pamela Samuelson, Science v. 293, 2001, 2028–2031. |
Class 9 Oct 29 |
Mini-Sessions: Chris/ICR; Martin/Image Processing | Sebasitainao, Impedovo, "More than twenty years of advancements on Frontiers in handwriting recognition," Pattern Recognition 47(3), March 2014, 916–928. |
Class 10 Nov 5 |
Mini-Sessions: Xiuyan/Music Generation; Cong/LDA | Xiuyan assigns us:
|
Class 11 Nov 12 |
Project Work | |
Class 12 Nov 19 |
Mini-Sessions: James/Sentiment Analysis |
|
Class 13 Dec 3 |
||
Class 14 Dec 10 |
Guest Speakers. Wrap-up and solution of the world's problems. | Topics/Readings TBD |
Finals Day Dec 17 |
Project Presentations |