Natural Language Processing and Social Interaction, Spring 2024

Main

More and more of life is now manifested online, and many of the digital traces that are left by human activity are increasingly recorded in natural-language format. This research-oriented course examines the opportunities for natural language processing to contribute to the analysis and facilitation of socially embedded processes. Possible topics include conversation modeling, analysis of group and sub-group language, language and social relations, persuasion and other causal effects of language.

Click on the tabs just above to see information about enrollment/prerequisite policies, administrative info, overall course structure, resources, and so on.

Enrollment, prerequisites, related classes

Enrollment Limited to [[PhD and [CS MS] students] who meet the prerequisites]; PhD students not in CS/INFO will receive manual instructor permission to enroll (details to be arranged at lecture). Auditing (either officially or unofficially) is not permitted. These policies are to enable class meetings to be heavily discussion-focused.

Prerequisites All of the following: (1) CS 2110 or equivalent programming experience; (2) a course in artificial intelligence or any relevant subfield (e.g., NLP, information retrieval, machine learning, Cornell CS courses numbered 47xx or 67xx); (3) proficiency with using machine learning tools (e.g., fluency at training an SVM or other classifier, comfort with assessing a classifier’s performance using cross-validation)

Please take a look at the contents of some of the papers on this quick list of sample papers (URLs should be clickable) before deciding on enrollment; if most of them seem completely impenetrable (or uninteresting), this class may not be the right fit for you.

Related classes: see Cornell's NLP course list.

In particular, Spring 2024 courses CS 6741 Topics in natural language processing and machine learning, CS 5740 Natural language processing (Cornell Tech students only), INFO 4940-LEC 006 Advanced NLP for Humanities Research, CS 4744 (and other crosslists) Computational linguistics I, or CS/IS 4300 Language and information may be a better choice for you; they are excellent courses for sure!

Other classes I am less knowledgeable about: SOC 6520 Culture wars in the age of tribal politics, GOVT 3282 Data science applications in political and social research.

The webpage from the last time I (Prof. Lee) taught this class may be useful, as might the webpage from the last time I taught a graduate NLP course.

Links, office hours

Websites

Ed Discussions page (access restricted to enrolled students). Course announcements and Q&A/discussion site. Social interaction and all that, you know.
Perusall
- Access via first passing through Canvas: use if not already logged in to Canvas.
- Direct link. Works if you are already logged in to Canvas, or if you have set up access from outside Cornell Canvas.
Zoom. Only accessible to enrolled students, and only meant for cases of illness, travel, and emergency. Notify the instructor ahead of time for each lecture you need to zoom-attend.
Lecture recordings site. Only accessible to enrolled students.
CMS. Site for submitting assignments, unless otherwise noted. Login with NetID credentials and select course CS 6742. You may find this graphically-oriented guide to common operations useful: see how to replace a prior submission; how to tell if CMS successfully received your files; how to form a group.

Office hours and contact info

See Prof. Lee's homepage and scroll to the section on Contact and availability info.

Coursework, policies (that aren't enrollment-related)

Coursework

In-class presentations (exact number depends on number of students enrolled and the difficulty of the papers we tackle). These may involve meeting with the instructor beforehand.

For days where another student is presenting: all non-presenting students are expected to prepare for class by at least skimming the abstract and intro of the paper(s) to be presented

Participation in discussion, either during class meetings or offline
Occasional small exercises of lecture material
Midterm paper that reviews and critically analyzes the class material. full instructions.
Final paper that reviews and critically analyzes the class material.

Policies

Use of AI generation/editing systems: For each component of the workload, the vast majority of the intellectual work must be originated by you, not by text generation systems. It is OK to use aids for writing fluency --- but note that writing fluency is not part of the assessment rubrics above anyway.

Example of something that is allowed: you write the initial draft(s), review its contents and double-check with the original paper. You then use some form of text generation system to proofread and improve the flow. You do not use the system’s output to add extra content.
Example of something that is definitely not allowed: You essentially use a text generation system to generate an early draft, even if you later post-edit and correct the output.
Example of something that is OK but requires special treatment: You start with the procedure in point 1. But, the system output includes good points that you hadn’t thought of before, or makes you realize that a point you had made isn’t quite right.
- You may include the new material and/or make appropriate edits, but you should mention what specific system(s) you used and what changes you made based on it.

Attendance: Please attend all class meetings in person that you are reasonably able to. If in-person attendance isn’t a reasonable option for a given class meeting, please contact the instructor ahead of time.
- Illness is always a valid reason to not attend and is not held against participation accounting, but please let me know that illness is the issue.
- Zoom attendance is available, but is only accessible to enrolled students, and only meant for cases of illness, travel, and emergency. Notify the instructor ahead of time for each lecture you need to zoom-attend.
Deadlines: We do not have slip days, and there is no "you can submit late for a small penalty": you need to hit the deadlines. But if there are extenuating circumstances, please email the instructor and we can talk. (Still submit what you have before the deadline, so we have an indication of your progress at that point.)
SDS accommodations: The instructor(s) have online access to SDS letters regarding accommodations for exams and other course matters, and will honor these accommodations. As recommended by the SDS office, we do ask that for each deadline, you let the instructor know beforehand in a timely fashion whether you wish to apply your accommodations.
Academic integrity
Claiming the work of others as your own is intellectual fraud and a violation of academic integrity. To avoid this, always track and credit your sources appropriately.

Each student in this course is expected to abide by the Cornell University Code of Academic Integrity. The Dean of the Faculty’s page has more information on Code and related procedures: https://theuniversityfaculty.cornell.edu/dean/academic-integrity/

Lectures

Note that assignments will remain visible even when details are hidden.

#1 Jan 23: Introduction

visualization of keep/delete comments in temporal order

notabilia.net

Lecture

Slides; recording (only available to enrolled students)

Lecture references and further reading

Danescu-Niculescu-Mizil, Cristian, Robert West, Dan Jurafsky, Jure Leskovec, and Christopher Potts. 2013. No country for old members: User lifecycle and linguistic change in online communities. WWW, pp. 307--318. Best paper award. [ACM link] [paper "homepage"]
Moritz Stefaner, Dario Taraborelli, Giovanni Luca Ciampaglia. 2011. Notabilia – Visualizing Deletion Discussions on Wikipedia
Justine Zhang, Jonathan Chang, Cristian Danescu-Niculescu-Mizil, Lucas Dixon, Yiqing Hua, Dario Taraborelli, Nithum Thain, 2018. Conversations Gone Awry: Detecting Early Signs of Conversational Failure. NAACL: 1350--1361.
Ziems, Caleb, William Held, Omar Shaikh, Jiaao Chen, Zhehao Zhang, and Diyi Yang. 2023. Can Large Language Models Transform Computational Social Science? Computational Linguistics, December, 1–53.

#2 Jan 25: Getting to know each other; easing into paper readings

Assignments/announcements

Annotation of the "No country" paper due on Perusall by midnight Wed Jan 31. See slides for details.

Lecture

Slides (pptx); recording (only available to enrolled students)

Lecture references and further reading

Some arguably lesser-known social-interaction sites: All Football (Chinese: 懂球帝), also known as Dongqiudi; BeReal.; Telegram; Xiaohongshu (Chinese: 小红书; pinyin: xiǎohóngshū; lit. 'Little Red Book') ; Zhibo Daihuo
Danescu-Niculescu-Mizil, Cristian, Robert West, Dan Jurafsky, Jure Leskovec, and Christopher Potts. 2013. No country for old members: User lifecycle and linguistic change in online communities. WWW, pp. 307--318. Best paper award. [ACM link] [paper "homepage"]

#3 Jan 30: Exploring differences between two language samples: "Fightin' Words"

Cat and Girl

Lecture

Slides, recording (only available to enrolled students)

Lecture references and further reading

Denny, Matt. 2016. Revisiting Fightin’ Words: Feature Selection Using an Informed Dirichlet Model.
Kleinberg, Jon. 2016. Temporal Dynamics of On-Line Information Streams. In Data Stream Management: Processing High-Speed Data Streams, 221–38. [DOI]
Liberman, Mark. Feb 12, 2022. The mystery of the decay. Language Log blog post.
Liberman, Mark. Debate words (Fox News Republican presidential debate) 2023. Liberman's Language Log blog post also links to his previous analyses of other data using Monroe et al.'s technique.
Kawintiranon, Kornraphop, and Lisa Singh. 2021. Knowledge Enhanced Masked Language Model for Stance Detection. NAACL, 4725–35.
Monroe, Burt L., Michael P. Colaresi, and Kevin M. Quinn. 2008. Fightin' words: Lexical feature selection and evaluation for identifying the content of political conflict. Political Analysis 16(4): 372-403. [alt link]

Implementations

Convokit implementation, based on prior code from Jack Hessel implementation and Xanda Schofield's visualizer
Denny, Matt. SpeedReader. In R.
Hessel, Jack (who took this class!). FightingWords. In Python.
Lim, Kenneth (who took this class!). fightin-words. Compliant with sci-kit learn and distributed by PyPI; borrows (with acknowledgment) from Jack's version.
Marzagão, Thiago. mcq.py. "Because this script processes one file at a time, it can handle corpora that are too large to fit in memory".
Silge, Julia, Alex Hayes, Tyler Schnoebelen. tidylo: Weighted Tidy Log Odds Ratio. In R.

#4 Feb 1: Distances between language sources

plot of the behavior of different distributional difference functions

Lecture

Handout, recording (only available to enrolled students)

Lecture references and further reading

#5 Feb 6: "No country for old members"

Jack Ziegler (license purchased)

Lecture

slides, recording (only available to enrolled students)

Lecture references and further reading

Danescu-Niculescu-Mizil, Cristian, Robert West, Dan Jurafsky, Jure Leskovec, and Christopher Potts. 2013. No country for old members: User lifecycle and linguistic change in online communities. WWW, pp. 307--318. Best paper award. [ACM link] [ paper "homepage" ]
Hamilton, William, Justine Zhang, Cristian Danescu-Niculescu-Mizil, Dan Jurafsky, and Jure Leskovec. 2017. Loyalty in Online Communities. ICWSM: 540–43.
Lucy, Li, and David Bamman. 2021. Characterizing English Variation across Social Media Communities with BERT. Transactions of the Association for Computational Linguistics 9 (May): 538–56. (It is the author's choice to be alphabetized by "Lucy".)
Tan, Chenhao, and Lillian Lee. 2015. All Who Wander: On the Prevalence and Characteristics of Multi-Community Engagement. WWW 1056–66. [paper homepage]
Tran, Trang, and Mari Ostendorf. 2016. Characterizing the Language of Online Communities and Its Relation to Community Reception. EMNLP, 1030–35.
Zhang, Justine, William Hamilton, Cristian Danescu-Niculescu-Mizil, Dan Jurafsky, and Jure Leskovec. 2017. “Community Identity and User Engagement in a Multi-Community Landscape.”ICWSM: 377–86. [arxiv version has some changes]

#6 Feb 8: Breezy intro to semantic shift

Assignments/announcements

Assignment 2: presentation/annotation of semantic shift papers: schedule and instructions posted.

derivations of `trump' over time in r/politics

Hofmann, Pierrehumbert, and Schuetze

Lecture

slides, recording (only available to enrolled students)

Lecture references and further reading

Boholm, Max, and Asad Sayeed. 2023. Political Dogwhistles and Community Divergence in Semantic Change. 4th Workshop on Computational Approaches to Historical Language Change
Garley, Matt, and Julia Hockenmaier. 2012. Beefmoves: Dissemination, Diversity, and Dynamics of English Borrowings in a German Hip Hop Forum. ACL (Volume 2: Short Papers), 135–39.
Hengchen, Simon, Nina Tahmasebi, Dominik Schlechtweg, and Haim Dubossarsky. 2021. Challenges for Computational Lexical Semantic Change. In Computational Approaches to Semantic Change. Zenodo version.
Hofmann, Valentin, Janet Pierrehumbert, and Hinrich Schütze. 2020. Predicting the Growth of Morphological Families from Social and Linguistic Factors. In ACL, pages 7273–7283.
Kutuzov, Andrey, Lilja Øvrelid, Terrence Szymanski, and Erik Velldal. 2018. Diachronic Word Embeddings and Semantic Shifts: A Survey. International Conference on Computational Linguistics.
Montanelli, Stefano, and Francesco Periti. 2023. A Survey on Contextualised Semantic Shift Detection. arXiv.
Pierrehumbert, Janet. 2012. The Dynamic Lexicon. In Handbook of Laboratory Phonology.
Tahmasebi, Nina, Lars Borin, and Adam Jatowt. 2021. Survey of Computational Approaches to Lexical Semantic Change Detection. In Computational Approaches to Semantic Change. Updated version of 2018 arxiv paper.

#7 Feb 13: Semantic shift II

Assignments/announcements

Assignment 3 "Fightin' words" announced: Ed post due and presentations on Th Feb 22. Details in slides.

Literally: You Keep Using That Word, I Do Not Think It Means What You Think It Means

KnowYourMeme

Lecture

slides, recording (available only to enrolled students)

Lecture references and further reading

Alammar, Jay. 2019. The Illustrated Word2vec. An explanation of SGNS, skip grams with negative sampling.
Antoniak, Maria and David Mimno. 2018. Evaluating the Stability of Embedding-based Word Similarities. Transactions of the Association for Computational Linguistics (TACL), 6:107–119.
Del Tredici, Marco, Raquel Fernández, and Gemma Boleda. 2019. Short-Term Meaning Shift: A Distributional Exploration. NAACL, Volume 1 (Long and Short Papers). https://doi.org/10.18653/v1/N19-1210.
Eliav, Ron, Anya Ji, Yoav Artzi, Robert D. Hawkins. 2023. Semantic uncertainty guides the extension of conventions to new referents. Annual Conference of the Cognitive Science Society (CogSci).
Fitch, W. Tecumseh. 2007. An Invisible Hand. Nature 7163:665--667. https://doi.org/10.1038/449665a.
Hamilton, William L., Jure Leskovec, and Dan Jurafsky. 2016. Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). https://doi.org/10.18653/v1/P16-1141.
Wendlandt, Laura, Jonathan K. Kummerfeld, and Rada Mihalcea. 2018. Factors Influencing the Surprising Instability of Word Embeddings. NAACL (Volume 1: Long Papers), 2092--2102.
5th workshop on Computational Approaches to Historical Language Change, 2024 and associated AXOLOTL-24 Shared Task on Explainable Semantic Change Modeling.

#8 Feb 15: Semantic shift: presentations by PH and BW.

xkcd

Lecture

Very recent New York Times article (Feb 10) about a semantic shift: The First Meaning of ‘Crush’ Came Long Before a ‘First Crush’.
Slides on Noble et al., by PH
Slides on Rosenfeld and Erk, by BW
recording (available only to enrolled students)

Lecture references and further reading

Noble, Bill, Asad Sayeed, Raquel Fernández, and Staffan Larsson. 2021. Semantic Shift in Social Networks. *SEM 2021: The Tenth Joint Conference on Lexical and Computational Semantics, 26–37.
Rosenfeld, Alex, and Katrin Erk. 2018. Deep Neural Models of Semantic Shift. NAACL, Volume 1 (Long Papers), 474–84.

#9 Feb 20: Semantic shift: Presentations by DK, HK, and TW

Lecture

Slides on Rudolph and Blei 2018 by HK
Slides on Liu et al. 2021 by DK
Slides on Card 2023 by TW
recording (only accessible to enrolled students)

Lecture references and further reading

Card, Dallas. 2023. Substitution-Based Semantic Change Detection Using Contextual Embeddings. ACL (Volume 2: Short Papers), 590–602.
Liu, Yang, Alan Medlar, and Dorota Glowacka. 2021. Statistically Significant Detection of Semantic Shifts Using Contextual Word Embeddings.. 2nd Workshop on Evaluation and Comparison of NLP Systems, 104–13.
Rudolph, Maja, and David Blei. 2018. Dynamic Embeddings for Language Evolution. World Wide Web Conference, 1003–11.

#10 Feb 22: Fightin' words presentations

Lecture

Slides on are Ed discussion. Recording (only accessible to enrolled students)

#11 Feb 27: No class: Feb break. Keeping the lecture number so that even lecture numbers remain Thursdays.

#12 Feb 29: Conversation I

Assignments/announcements

SEE SLIDES: A4, A5 (list of potential paper selections), midterm paper

talking about the purpose of conversations

Cat and Girl

Lecture

Slides, recording (available only to enrolled students)

Lecture references and further reading

Dingemanse, Mark and Andreas Liesenfeld. 2022. From text to talk: Harnessing conversational corpora for humane and diversity-aware language technology. ACL(Volume 1: Long Papers), pages 5614–5633.
Yeomans, Michael, F. Katelynn Boland, Hanne K. Collins, Nicole Abi-Esber, and Alison Wood Brooks. 2023. A Practical Guide to Conversation Research: How to Study What People Say to Each Other. Advances in Methods and Practices in Psychological Science.

#13 Mar 5: Conversation II: the Grosz and Sidner '86 theory of discourse

Garry Kasparov, Maurice Ashley, Yasser Seirawan and a bunch of soft drinks during the period of the 1996 match against Deep Blue. Photo by Kenneth Thompson, provided at computerhistory.org

Lecture

Slides, handout/A4 data, recording (the latter available only to enrolled students)

Lecture references and further reading

Grosz, Barbara J., and Sidner, Candace L. 1986. Attention, intentions, and the structure of discourse. Computational Linguistics 12(3): 175-204.
Summary of and source for transcripts of live commentary on the first Kasparov/Deep Blue match

#14 Mar 7: Conversation III: Conversational trajectories

illustrated passage about a conversation going wrong

Allie Brosh

Lecture

Slides, recording (only available to enrolled students)

Lecture references and further reading

Bao, Jiajun, Junjie Wu, Yiming Zhang, Eshwar Chandrasekharan, and David Jurgens. 2021. Conversations Gone Alright: Quantifying and Predicting Prosocial Outcomes in Online Conversations. In Proceedings of the Web Conference 2021 (WWW '21). Association for Computing Machinery, New York, NY, USA, 1134–1145. https://doi.org/10.1145/3442381.3450122
Niculae, Vlad and Cristian Danescu-Niculescu-Mizil. 2016. Conversational Markers of Constructive Discussions. NAACL, 568–578.
Zhang, Justine, Jonathan Chang, Cristian Danescu-Niculescu-Mizil, Lucas Dixon, Yiqing Hua, Dario Taraborelli, and Nithum Thain. 2018. Conversations Gone Awry: Detecting Early Signs of Conversational Failure. ACL (Volume 1: Long Papers), 1350–61. https://doi.org/10.18653/v1/P18-1125.

#15 Mar 12: Presentations by KL and AM

Wondermark comic on the terrible sea lion

David Malki

Lecture

Slides on Mirzaei, Meshgi, and Sekine by AM
Slides on Yi and Zubiaga by KL
recording (available only to enrolled students)

Lecture references and further reading

Mirzaei, Maryam Sadat, Kourosh Meshgi, and Satoshi Sekine. 2023. What Is the Real Intention behind This Question? Dataset Collection and Intention Classification. ACL (Volume 1: Long Papers). https://doi.org/10.18653/v1/2023.acl-long.761.
Yi, Peiling, and Arkaitz Zubiaga. 2023. Learning like Human Annotators: Cyberbullying Detection in Lengthy Social Media Sessions. WWW. https://doi.org/10.1145/3543507.3583873.

#16 Mar 14: Presentations by FH, MM, YW

Assignments/announcements

Full midterm-paper instructions released
Policies on academic integrity, use of AI generation/editing systems posted on course webpage

Lecture

Slides on Fu, Chang, and Danescu-Niculescu-Mizil by MM
Slides on Ge, Cheng, and Liu by FH
Slides on Altarawneh, Agrawal, Jenkin and Papagelis by YW
recording (accessible only to enrolled students)

Lecture references and further reading

Altarawneh, Enas, Ameeta Agrawal, Michael Jenkin, and Manos Papagelis. 2023. Conversation Derailment Forecasting with Graph Convolutional Networks. 7th Workshop on Online Abuse and Harms (WOAH) 160–69. DOI: 10.18653/v1/2023.woah-1.16
Fu, Liye, Jonathan P. Chang, and Cristian Danescu-Niculescu-Mizil. 2019. Asking the Right Question: Inferring Advice-Seeking Intentions from Personal Narratives. NAACL Volume 1 (Long and Short Papers). https://doi.org/10.18653/v1/N19-1052.
Ge, Suyu, Lu Cheng, and Huan Liu. 2021. Improving Cyberbullying Detection with User Interaction. WWW, 496–506.

#17 Mar 19: Presentations by AB and EF

Lecture

Slides on Habernal, Wachsmuth, Gurevych, and Stein by EF
Slides on Yuan and Singh by AB
recording (accessible only to enrolled students)

Lecture references and further reading

Habernal, Ivan, Henning Wachsmuth, Iryna Gurevych, and Benno Stein. 2018. Before Name-Calling: Dynamics and Triggers of Ad Hominem Fallacies in Web Argumentation. NAACL, Volume 1 (Long Papers), 386–96. doi:10.18653/v1/N18-1036.
Yuan, Jiaqing, and Munindar P. Singh. 2023. Conversation Modeling to Predict Derailment. International AAAI Conference on Web and Social Media (ICWSM) 17: 926–35. doi:10.1609/icwsm.v17i1.22200. Observation: there are some differences of note between the proceedings version and the arxiv version; the latter makes it clear that the datasets used were not created by the authors.

#18 Mar 21: Reflections on intention-recognition and conversation-trajectories papers presented

joke where the intention was actually to find out if someone had the time

David Malki!

Lecture

slides, recording (available only to enrolled students)

Lecture references and further reading

The main papers were the seven posted in the previous three lectures.
Chang, Jonathan P., Justin Cheng, and Cristian Danescu-Niculescu-Mizil. 2020. Don’t Let Me Be Misunderstood: Comparing Intentions and Perceptions in Online Discussions. WWW, 2066–2077. https://doi.org/10.1145/3366423.3380273
Chang, Jonathan P., Charlotte Schluger, and Cristian Danescu-Niculescu-Mizil. 2022. Thread With Caution: Proactively Helping Users Assess and Deescalate Tension in Their Online Discussions. CSCW. https://doi.org/10.1145/3555603
Choi, Frederick, Tanvi Bajpai, Sowmya Pratipati, and Eshwar Chandrasekharan. 2023. ConvEx: A Visual Conversation Exploration System for Discord Moderators. CSCW, https://doi.org/10.1145/3610053
Ferracane, Elisa, Greg Durrett, Junyi Jessy Li, and Katrin Erk. 2021. Did they answer? Subjective acts and intents in conversational discourse. NAACL, 1626–1644.
Ranganath, Rajesh, Dan Jurafsky, and Dan McFarland. 2009. It’s Not You, It’s Me: Detecting Flirting and its Misperception in Speed-Dates.EMNLP, 334-342.

#19 Mar 26: Midterm consultations

Assignments/announcements

~~Midterm paper due, 11:59pm~~ Date moved

#20 Mar 28: Midterm consultations

Fri Mar 29: midterm paper due 11:59pm on CMSX. [instructions]

Apr 2: No class — Spring break

Apr 4: No class — Spring break

#21 Apr 9: (Cancelled: out sick)

#22 Apr 11: Community-specific controversy prediction with early comment trees

Lecture

Slides, recording (accessible only to enrolled students)

Lecture references and further reading

Hessel, Jack and Lillian Lee. 2019. Something’s Brewing! Early Prediction of Controversy-causing Posts from Discussion Features. NAACL, pages 1648–1659.
Salganik, Matthew J., Peter Sheridan Dodds, and Duncan J. Watts. 2006. Experimental Study of Inequality and Unpredictability in an Artificial Cultural Market. Science 311:854-856. DOI:10.1126/science.1121066

#23 Apr 16: NLP and causal inference: an example paper

Assignments/announcements

List of paper choices, schedule, and instructions for A6, your last presentation/annotation

Lecture

Slides, recording (only accessible to enrolled students)

Lecture references and further reading

Bryan, Christopher J., Gregory M. Walton, Todd Rogers, and Carol S. Dweck. 2011. Motivating Voter Turnout by Invoking the Self. Proceedings of the National Academy of Sciences 108 (31): 12653–56. https://doi.org/10.1073/pnas.1103343108.

Followup: failure to replicate: Gerber, Alan S., Gregory A. Huber, Daniel R. Biggers, and David J. Hendry, June 28, 2016. A field experiment shows that subtle linguistic cues might not affect voter behavior. Proceedings of the National Academy of Sciences 113(26): 7112-7117.
Response to followup: "What is an authentic replication attempt and what is not? Gerber et al.’s paper ... gives us the opportunity to reflect on this issue of longstanding concern to us." Bryan, Christopher J., Gregory M. Walton, and Carol S. Dweck, Oct 18, 2016. Psychologically authentic versus inauthentic replication attempts. Proceedings of the National Academy of Sciences 113(43): E6548.
Response: "Although we find Bryan et al.’s ... explanation unconvincing, this exchange is well-timed. The original findings have (to our knowledge) never been successfully replicated, and this November provides ample opportunity to test noun vs. verb in the political environment Bryan et al. ... suggest is ideal for producing 11–14 percentage-point effects." Gerber, Alan S., Gregory A. Huber, Daniel R. Biggers, and David J. Hendry, Oct 25, 2016. Reply to Bryan et al.: Variation in context unlikely explanation of nonrobustness of noun versus verb results. Proceedings of the National Academy of Sciences 113(43): E6549--E6550.
Gerber, Alan, Gregory Huber, Albert Fang, 2018. Do Subtle Linguistic Interventions Priming a Social Identity as a Voter Have Outsized Effects on Voter Turnout? Evidence From a New Replication Experiment Political Psychology 39: 925--938.
Bryan, Christopher J., David S. Yeager, and Joseph M. O’Brien, 2019. Replicator degrees of freedom allow publication of misleading failures to replicate. Proceedings of the National Academy of Sciences 116 (51) 25535--25545.
Gerber, Alan S., Gregory A. Huber, Albert H. Fang, 2020. Voting behavior is unaffected by subtle linguistic cues: Evidence from a psychologically authentic replication. Behavioural Public Policy, 1--15.
Green, Donald P., and José S. Gomez. Psychological Theories Meet the Challenge of Persuading and Mobilising Voters. 2022. In The Cambridge Handbook of Political Psychology, edited by Danny Osborne and Chris G. Sibley, 476–91. Cambridge Handbooks in Psychology. Cambridge: Cambridge University Press.

The Grammar of Persuasion: A Meta-Analytic Review Disconfirming the Role of Nouns as Linguistic Cues of Subsequent Behavior

Journal of Language and Social Psychology

Pryzant, Reid, Dallas Card, Dan Jurafsky, Victor Veitch, and Dhanya Sridhar. 2021. Causal Effects of Linguistic Properties. NAACL, 4095–4109.

#24 Apr 18: Polarization presentations (A6 part 1)

Lecture

Slides on Bianchi, Marelli, Nicoli, and Palmonari by AB
Slides on Efstratiou by TW
Recording (only available to enrolled students)

Lecture references and further reading

Bianchi, Federico, Marco Marelli, Paolo Nicoli, and Matteo Palmonari. 2021. SWEAT: Scoring Polarization of Topics across Different Corpora. EMNLP, 10065–72. https://doi.org/10.18653/v1/2021.emnlp-main.788.
Efstratiou, Alexandros. 2024. Deliberate Exposure to Opposing Views and Its Association with Behavior and Rewards on Political Communities. The Web Conference. http://arxiv.org/abs/2401.14608.
Jensen, Jacob, Suresh Naidu, Ethan Kaplan, Laurence Wilse-Samson. 2012. Political Polarization and the Dynamics of Political Language: Evidence from 130 Years of Partisan Speech [with Comments and Discussion].” Brookings Papers on Economic Activity, 1–81. http://www.jstor.org/stable/41825364.
Németh, Renáta. 2023. A scoping review on the use of natural language processing in research on political polarization: Trends and research prospects. Journal of Computational Social Science 6, 289–313 (2023). https://doi.org/10.1007/s42001-022-00196-2

#25 Apr 23: Paper presentations (A6 part 2)

Lecture

Slides by HK on Bao et al.
Slides by MM on Tierney et al.
recording (only available to enrolled students)

Lecture references and further reading

Bao, Jiajun, Junjie Wu, Yiming Zhang, Eshwar Chandrasekharan, and David Jurgens. 2021. Conversations Gone Alright: Quantifying and Predicting Prosocial Outcomes in Online Conversations. In Proceedings of the Web Conference 2021 (WWW '21). Association for Computing Machinery, New York, NY, USA, 1134–1145. https://doi.org/10.1145/3442381.3450122
Tierney, Graham and Alexander Volfovsky. 2021. Sensitivity Analysis for Causal Mediation through Text: an Application to Political Polarization. First Workshop on Causal Inference and NLP, 61–73.

#26 Apr 25: Paper presentations (A6 part 3)

Lecture

Slides by KL on Jo et al.
Slides by DK on Imran, Chatterjee, and Madevski
Recording (available only to enrolled students)

Lecture references and further reading

Jo, Yohan, Shivani Poddar, Byungsoo Jeon, Qinlan Shen, Carolyn Rose, and Graham Neubig. 2018. Attentive Interaction Model: Modeling Changes in View in Argumentation. NAACL, 103–16.
Imran, Mia Mohammad, Preetha Chatterjee, and Kostadin Damevski. 2024. Uncovering the Causes of Emotions in Software Developer Communication Using Zero-Shot LLMs.” IEEE/ACM 46th International Conference on Software Engineering 1–13.

#27 Apr 30: Paper presentations (A6 part 4)

Lecture

Slides on De Kock, Stafford, and Vlachos by YW
Slides on Stewart and Mihalcea by EF
recording (only accessible for enrolled students)

Lecture references and further reading

De Kock, Christine, Tom Stafford, and Andreas Vlachos. 2022. How to Disagree Well: Investigating the Dispute Tactics Used on Wikipedia. In Proceedings of the 2022 EMNLP, 3824–37.
Stewart, Ian and Rada Mihalcea. 2022. How Well Do You Know Your Audience? Toward Socially-aware Question Generation. SIGDial, pages 255–269

#28 May 2: Paper presentations (A6 part 5)

Lecture

Slides on Agarwal et al. by FH
Slides on Khare et al. by PH
recording (accessible only to enrolled students)

Lecture references and further reading

Agarwal, Vibhor, Sagar Prakash Joglekar, Anthony P. Young, and Nishanth R. Sastry. 2022. GraphNLI: A Graph-based Natural Language Inference Model for Polarity Prediction in Online Debates. The Web Conference, 2729–2737. https://doi.org/10.1145/3485447.3512144
Khare, Prashant, Ravi Shekhar, Mladen Karan, Stephen McQuistin, Colin Perkins, Ignacio Castro, Gareth Tyson, Patrick Healey, and Matthew Purver. 2023. Tracing Linguistic Markers of Influence in a Large Online Organisation. ACL (Volume 2: Short Papers), pages 82–90.

#29 May 7: Paper presentations (A6 part 6)

Assignments/announcements

Instructions for the final paper released

Lecture

Slides by BW on Demszky et al.
Slides by AM on Ding, Horning and Rho
recording (only accessible to enrolled students)

Lecture references and further reading

Demszky, Dorottya, Jing Liu, Zid Mancenido, Julie Cohen, Heather Hill, Dan Jurafsky, and Tatsunori Hashimoto. 2021. Measuring Conversational Uptake: A Case Study on Student-Teacher Interactions. ACL/IJCNLP (Volume 1: Long Papers), pages 1638–1653.
Ding, Xiaohan, Horning, Michael, and Rho, Eugenia H. 2023. Same Words, Different Meanings: Semantic Polarization in Broadcast Media Language Forecasts Polarity in Online Public Discourse. ICWSM, 17(1), 161-172.
Leskovec, Jure, Lars Backstrom, and Jon Kleinberg. 2009. Meme-Tracking and the Dynamics of the News Cycle. KDD, 497--506. https://doi.org/10.1145/1557019.1557077. [website]

May 16 (Th), 4:30pm, as determined by the registrar: Final paper due. [instructions]

Code for generating the calendar formatting adapted from Andrew Myers. Portions of the content of this website and course were created by collaboration between Cristian Danescu-Niculescu-Mizil and Lillian Lee over multiple runnings of this course.