Knuples, U., Falenska, A., & Miletić, F. (2024). Gender Identity in Pretrained Language Models: An Inclusive Approach to Data Creation and Probing. In Y. Al-Onaizan, M. Bansal, & Y.-N. Chen (Eds.),
Findings of the Association for Computational Linguistics: EMNLP 2024 (pp. 11612--11631). Association for Computational Linguistics.
https://aclanthology.org/2024.findings-emnlp.680
Abstract
Pretrained language models (PLMs) have been shown to encode binary gender information of text authors, raising the risk of skewed representations and downstream harms. This effect is yet to be examined for transgender and non-binary identities, whose frequent marginalization may exacerbate harmful system behaviors. Addressing this gap, we first create TRANsCRIPT, a corpus of YouTube transcripts from transgender, cisgender, and non-binary speakers. Using this dataset, we probe various PLMs to assess if they encode the gender identity information, examining both frozen and fine-tuned representations as well as representations for inputs with author-specific words removed. Our findings reveal that PLM representations encode information for all gender identities but to different extents. The divergence is most pronounced for cis women and non-binary individuals, underscoring the critical need for gender-inclusive approaches to NLP systems.BibTeX
Dönmez, E., Vu, T., & Falenska, A. (2024). Please note that I’m just an AI: Analysis of Behavior Patterns of LLMs in (Non-)offensive Speech Identification. In Y. Al-Onaizan, M. Bansal, & Y.-N. Chen (Eds.),
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (pp. 18340--18357). Association for Computational Linguistics.
https://aclanthology.org/2024.emnlp-main.1019
Abstract
Offensive speech is highly prevalent on online platforms. Being trained on online data, Large Language Models (LLMs) display undesirable behaviors, such as generating harmful text or failing to recognize it. Despite these shortcomings, the models are becoming a part of our everyday lives by being used as tools for information search, content creation, writing assistance, and many more. Furthermore, the research explores using LLMs in applications with immense social risk, such as late-life companions and online content moderators. Despite the potential harms from LLMs in such applications, whether LLMs can reliably identify offensive speech and how they behave when they fail are open questions. This work addresses these questions by probing sixteen widely used LLMs and showing that most fail to identify (non-)offensive online language. Our experiments reveal undesirable behavior patterns in the context of offensive speech detection, such as erroneous response generation, over-reliance on profanity, and failure to recognize stereotypes. Our work highlights the need for extensive documentation of model reliability, particularly in terms of the ability to detect offensive language.BibTeX
Sindermann, C. (2024). Relations between different components of group identification and types of social media political participation in the context of the Fridays for Future movement.
Personality and Individual Differences,
230, 112773.
https://doi.org/10.1016/j.paid.2024.112773
Abstract
Social media revolutionized political participation possibilities, and movements like Fridays for Future (FFF) rely on them. Understanding the factors driving the novel online modes of political participation is crucial. Consequently, this project explores the relations between diverse group identification components and social media political participation types in the context of FFF, drawing on the Social Identity Model of Pro-Environmental Action and extending previous work by treating group identification as multidimensional construct. Two German quota samples (N = 619, 45 \% men; N = 616, 44 \% men) completed online surveys assessing group identification components and social media political participation types applied to the FFF context. Generalized linear models were used to investigate their relations. Centrality – the extent to which individuals perceive their membership in FFF as an important aspect of their self-concept – was the strongest and most consistent predictor of Counter, Follower, and Expressive Engagement via social media in both samples. Additionally, each social media political participation type exhibited distinct connections with different group identification components. These results underscore the relevance of group identification in emerging forms of social media political participation. The diverse connections between distinct group identification components and participation types underscore the need to examine both considering their diverse forms.BibTeX
Erhard, L., Hanke, S., Remer, U., Falenska, A., & Heiberger, R. H. (2024). PopBERT. Detecting Populism and Its Host Ideologies in the German Bundestag.
Political Analysis.
https://doi.org/10.1017/pan.2024.12
BibTeX
Hillebrand, M. C., Sindermann, C., Montag, C., Wuttke, A., Heinzelmann, R., Haas, H., & Wilz, G. (2024). Salivary cortisol and alpha-amylase as stress markers to evaluate an individualized music intervention for people with dementia: feasibility and pilot analyses.
BMC Research Notes,
17(1), Article 1.
https://doi.org/10.1186/s13104-024-06904-7
Abstract
We investigated salivary biomarkers of stress, more specifically, cortisol and alpha-amylase, to evaluate effects of individualized music listening (IML) in people with dementia.BibTeX
Brandenstein, N., Montag, C., & Sindermann, C. (2024). To Follow or Not to Follow: Estimating Political Opinion From Twitter Data Using a Network-Based Machine Learning Approach.
Social Science Computer Review.
https://doi.org/10.1177/08944393241279418
BibTeX
Kaiser, J., & Falenska, A. (2024). How to Translate SQuAD to German? A Comparative Study of Answer Span Retrieval Methods for Question Answering Dataset Creation. In P. H. Luz de Araujo, A. Baumann, D. Gromann, B. Krenn, B. Roth, & M. Wiegand (Eds.),
Proceedings of the 20th Conference on Natural Language Processing (KONVENS 2024) (pp. 134--140). Association for Computational Linguistics.
https://aclanthology.org/2024.konvens-main.15
BibTeX
Chen, H., Roth, M., & Falenska, A. (2024). What Can Go Wrong in Authorship Profiling: Cross-Domain Analysis of Gender and Age Prediction. In A. Faleńska, C. Basta, M. Costa jussà, S. Goldfarb-Tarrant, & D. Nozza (Eds.),
Proceedings of the 5th Workshop on Gender Bias in Natural Language Processing (GeBNLP) (pp. 150--166). Association for Computational Linguistics.
https://aclanthology.org/2024.gebnlp-1.9
Abstract
Authorship Profiling (AP) aims to predict the demographic attributes (such as gender and age) of authors based on their writing styles. Ever-improving models mean that this task is gaining interest and application possibilities. However, with greater use also comes the risk that authors are misclassified more frequently, and it remains unclear to what extent the better models can capture the bias and who is affected by the models' mistakes. In this paper, we investigate three established datasets for AP as well as classical and neural classifiers for this task. Our analyses show that it is often possible to predict the demographic information of the authors based on textual features. However, some features learned by the models are specific to datasets. Moreover, models are prone to errors based on stereotypes associated with topical bias.BibTeX
Faleńska, A., Basta, C., Costa jussà, M., Goldfarb-Tarrant, S., & Nozza, D. (Eds.). (2024).
Proceedings of the 5th Workshop on Gender Bias in Natural Language Processing (GeBNLP). Association for Computational Linguistics.
https://aclanthology.org/2024.gebnlp-1.0
BibTeX
Costa jussà, M., Andrews, P., Basta, C., Ciro, J., Falenska, A., Goldfarb-Tarrant, S., Mosquera, R., Nozza, D., & Sánchez, E. (2024). Overview of the Shared Task on Machine Translation Gender Bias Evaluation with Multilingual Holistic Bias. In A. Faleńska, C. Basta, M. Costa jussà, S. Goldfarb-Tarrant, & D. Nozza (Eds.),
Proceedings of the 5th Workshop on Gender Bias in Natural Language Processing (GeBNLP) (pp. 399--404). Association for Computational Linguistics.
https://aclanthology.org/2024.gebnlp-1.26
Abstract
We describe the details of the Shared Task of the 5th ACL Workshop on Gender Bias in Natural Language Processing (GeBNLP 2024). The task uses dataset to investigate the quality of Machine Translation systems on a particular case of gender robustness. We report baseline results as well as the results of the first participants. The shared task will be permanently available in the Dynabench platform.BibTeX
Go, P., & Falenska, A. (2024). Is there Gender Bias in Dependency Parsing? Revisiting ``Women’s Syntactic Resilience’’. In A. Faleńska, C. Basta, M. Costa jussà, S. Goldfarb-Tarrant, & D. Nozza (Eds.),
Proceedings of the 5th Workshop on Gender Bias in Natural Language Processing (GeBNLP) (pp. 269--279). Association for Computational Linguistics.
https://aclanthology.org/2024.gebnlp-1.17
Abstract
In this paper, we revisit the seminal work of Garimella et al. 2019, who reported that dependency parsers learn demographically-related signals from their training data and perform differently on sentences authored by people of different genders. We re-run all the parsing experiments from Garimella et al. 2019 and find that their results are not reproducible. Additionally, the original patterns suggesting the presence of gender biases fail to generalize to other treebank and parsing architecture. Instead, our data analysis uncovers methodological shortcomings in the initial study that artificially introduced differences into female and male datasets during preprocessing. These disparities potentially compromised the validity of the original conclusions.BibTeX
Babiker, A., Alshakhsi, S., Sindermann, C., Montag, C., & Ali, R. (2024). Examining the growth in willingness to pay for digital wellbeing services on social media: A comparative analysis.
Heliyon,
10(11), Article 11.
https://doi.org/10.1016/j.heliyon.2024.e32467
BibTeX
Falenska, A., Vecchi, E. M., & Lapesa, G. (2024). Self-reported Demographics and Discourse Dynamics in a Persuasive Online Forum. In N. Calzolari, M.-Y. Kan, V. Hoste, A. Lenci, S. Sakti, & N. Xue (Eds.),
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) (pp. 14606--14621). ELRA and ICCL.
https://aclanthology.org/2024.lrec-main.1272
Abstract
Research on language as interactive discourse underscores the deliberate use of demographic parameters such as gender, ethnicity, and class to shape social identities. For example, by explicitly disclosing one's information and enforcing one's social identity to an online community, the reception by and interaction with the said community is impacted, e.g., strengthening one's opinions by depicting the speaker as credible through their experience in the subject. Here, we present a first thorough study of the role and effects of self-disclosures on online discourse dynamics, focusing on a pervasive type of self-disclosure: author gender. Concretely, we investigate the contexts and properties of gender self-disclosures and their impact on interaction dynamics in an online persuasive forum, ChangeMyView. Our contribution is twofold. At the level of the target phenomenon, we fill a research gap in the understanding of the impact of these self-disclosures on the discourse by bringing together features related to forum activity (votes, number of comments), linguistic/stylistic features from the literature, and discourse topics. At the level of the contributed resource, we enrich and release a comprehensive dataset that will provide a further impulse for research on the interplay between gender disclosures, community interaction, and persuasion in online discourse.BibTeX
Sindermann, C., Löchner, N., Heinzelmann, R., Montag, C., & Scholz, R. W. (2024). The Revenue Model of Mainstream Online Social Networks and Potential Alternatives: A Scenario-Based Evaluation by German Adolescents and Adults.
Technology in Society, 102569.
https://doi.org/10.1016/j.techsoc.2024.102569
BibTeX
BibTeX
Scholz, R. W., Köckler, H., Zscheischler, J., Czichos, R., Hofmann, K.-M., & Sindermann, C. (2024). Transdisciplinary knowledge integration PART II: Experiences of five transdisciplinary processes on digital data use in Germany.
Technological Forecasting and Social Change,
199, 122981.
https://doi.org/10.1016/j.techfore.2023.122981
Abstract
Transdisciplinary problems are complex, ill-defined, societal real-world problems with high ambiguity that are contested and require multiple trade-offs. Part I of this paper showed that transdisciplinary processes include seven types of knowledge integration: system (ontological), epistemological, cultural, cognitive, social conflict, evolutionary (levels of representation), and complexity-theory-based types of knowledge integration. The epistemological integration of the different modes of reasoning from science and practice is a unique selling point of the transdisciplinary process. Part II presents five transdisciplinary processes for the responsible use of digital data in different vulnerable/sensitive subsystems of Germany (mobility, health, agriculture, SME, and social media). Between 10 and 18 participants (equally representing science and practice in each group) synchronously constructed socially robust orientations as pillars of a white book. We elaborate that outcomes of a transdisciplinary process can be improved, and barriers diminished by reflecting on which of the seven types of knowledge integration are applied (see Part I). This is done for the six phases of a transdisciplinary process: (1) triggering, (2) initiation, (3) preparation, (4) planning, (5) core, and (6) post-processing. We particularly address researchers and practitioners who seek insights into how the production and integration of knowledge can be improved by transdisciplinary processes.BibTeX
Sindermann, C., Montag, C., & Elhai, J. D. (2024). The Degree of Homogeneity Versus Heterogeneity in Individuals’ Political News Consumption - https://econtent.hogrefe.com/doi/abs/10.1027/1864-1105/a000417?journalCode=zmp.
Journal of Media Psychology.
https://doi.org/10.1027/1864-1105/a000417
Abstract
Abstract: This work investigated the prevalence of filter bubble or echo chamber-related phenomena, psychological factors rendering individuals resilient or vulnerable to them, and their associations to political views focusing on extremity and polarization. For this, a cross-cultural replication of a study by Sindermann et al. (2021) was conducted. As an extension, multiple political views variables were assessed to examine whether the application of different conceptualizations of political views explains heterogeneous findings across previous studies. Two samples were recruited: 390 (n = 135 males) US college students and a quota sample of 489 (n = 243 males) US adults. Participants completed personality scales and measures on political news consumption homogeneity versus heterogeneity and political views. Consistent with previous research, results revealed few individuals consume political news absolutely homogeneously. Openness was negatively related to the degree of political news consumption homogeneity, and the relationship between political news consumption homogeneity and political views yielded inconsistent, often statistically nonsignificant, results. These findings challenge the prevailing notion of filter bubbles and echo chambers as widespread phenomena and indicate that relationships between political news consumption homogeneity and political views are not necessarily deleterious with respect to extremization and polarization. As such, the results suggest that these phenomena might not be as significant for the general population as previously thought. Nonetheless, certain individuals might still find themselves in filter bubbles or echo chambers and suffer from accompanying consequences. In this regard, the present work replicates findings underscoring that individuals with lower Openness exhibit greater political news consumption homogeneity.BibTeX
BibTeX
Zermiani, F., Dhar, P., Strohm, F., Baumbach, S., Bulling, A., & Wirzberger, M. (2024). Individual differences in visuo-spatial working memory capacity and prior knowledge during interrupted reading.
Frontiers in Cognition,
3.
https://doi.org/10.3389/fcogn.2024.1434642
Abstract
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
Email
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Interruptions are often pervasive and require attentional shifts from the primary task. Limited data are available on the factors influencing individuals' efficiency in resuming from interruptions during digital reading. The reported investigation -conducted using the InteRead dataset -examined whether individual differences in visuo-spatial working memory capacity (vsWMC) and prior knowledge could influence resumption lag times during interrupted reading. Participants' vsWMC capacity was assessed using the symmetry span (SSPAN) task, while a pre-test questionnaire targeted their background knowledge about the text. While reading an extract from a Sherlock Holmes story, they were interrupted six times and asked to answer an opinion question. Our analyses revealed that the interaction between vsWMC and prior knowledge significantly predicted the time needed to resume reading following an interruption. The results from our analyses are discussed in relation to theoretical frameworks of task resumption and current research in the field.BibTeX
Scholz, R. W., Zscheischler, J., Köckler, H., Czichos, R., Hofmann, K.-M., & Sindermann, C. (2024). Transdisciplinary knowledge integration – PART I: Theoretical foundations and an organizational structure.
Technological Forecasting and Social Change,
202, 123281.
https://doi.org/10.1016/j.techfore.2024.123281
Abstract
Transdisciplinary processes deal with transdisciplinary problems that are (i) complex, (ii) societally relevant, (iii) ill-defined, and (iv) real-world problems which often show a high degree of ambiguity resulting in contested perceptions and evaluations among and between scientists and practitioners. Therefore, they are susceptible to multiple trade-offs. Transdisciplinary processes construct socially robust orientations (SoROs) particularly for sustainable transitioning. The integration of science and practice knowledge on equal footing (1) is considered the core of transdisciplinary processes. Yet other forms of knowledge integration contribute essentially to construct SoROs. Individuals may (2) use different modes of thought; (3) refer to various cultures with diverse value and belief systems; and (4) problems are perceived and prioritized based on roles and interests. Coping with transdisciplinary problems, (5) purposeful differentiation and integration and (6) an integration of evolutionary evolving codes of representing knowledge are necessary. Finally, (7) what systems to integrate requires consensus-building among participating scientists and practitioners. This paper is Part I of a two-part publication. It provides a conceptualization of the different types of knowledge integration. Part II analyzes tasks, challenges, and barriers related to different types of knowledge integration in five transdisciplinary processes which developed SoROs for sensitive subsystems of Germany affected by the irresponsible use of digital data.BibTeX
BibTeX
Vaugrante, L., Niepert, M., & Hagendorff, T. (2024).
A Looming Replication Crisis in Evaluating Behavior in Language Models? Evidence and Solutions.
https://arxiv.org/abs/2409.20303
BibTeX
Meding, K., & Hagendorff, T. (2024). Fairness Hacking: The Malicious Practice of Shrouding Unfairness in Algorithms. Philosophy & Technology, 37(1), Article 1.
BibTeX
Alshakhsi, S., Babiker, A., Sindermann, C., Al-Thani, D., Montag, C., & Ali, R. (2024). Willingness to pay for digital wellbeing features on social network sites: a study with Arab and European samples.
Frontiers in Computer Science,
6, 1387681.
https://doi.org/10.3389/fcomp.2024.1387681
BibTeX
Jalali Farahani, F., Hanke, S., Dima, C., Heiberger, R. H., & Staab, S. (2024). Who is targeted? Detecting social group mentions in online political discussions.
Companion Publication of the 16th ACM Web Science Conference, 24–25.
https://doi.org/10.1145/3630744.3658412
Abstract
Social groups are central to political discussions. However, detecting social groups in text often relies on pre-determined socio-demographic categories or supervised learning methods that require extensive hand-labeled datasets. In this paper, we propose a methodology designed to leverage the potential of Large Language Models (LLMs) for the identification and annotation of social groups in text. The experiments show that open LLMs like Llama-2-70B-Chat and Mixtral-8-7B can reliably be used to annotate social groups in a few-shot scenario without the need for supervised learning. The automatically obtained annotations largely match human annotations on random samples from the Reddit Politosphere, resulting in micro-F1 scores of 0.71 and 0.83, respectively.BibTeX
BibTeX
Abstract
Advanced digitization and related information overload foster the prevalence of disruptive stimuli that constantly challenge people in learning and working contexts. The high variety of potential distractions increasingly reduces attention and subsequently minimizes capacities for productive study and work habits. Addressing this challenge, digital tools can support people with resisting situational temptations and keeping focused on meaningful tasks by incorporating features such as timing, rewards, or feedback. While they hold benefits for users’ focused behavior, distraction management, and motivation, existing research also shows hesitation towards using such tools at all. Taking the recently introduced software focUS as an example, the present research investigates in more detail, which factors can foster or hinder users’ acceptance towards digital self-control support. A sample of 96 adult volunteers completed an online survey based on the Unified Theory of Acceptance and Use of Technology (UTAUT) to identify under what circumstances they would be willing to use the previously introduced software tool. Results indicate expected performance gains to be an important predictor to consider when presenting novel technological assistance to potential target groups.BibTeX
Abstract
Chatbots have become increasingly prevalent in our daily lives. Research shows that, while some users trust such technology even with sensitive information, others refuse to rely on this technology for support. When explaining trust in such technology, existing human-technology trust approaches are overlooking factors like users’ emotional states. This three-part experimental study in a real-world setting examines the influence emotional states have on trusting a faulty chatbot. Participants interacted with a chatbot in part one of the study to schedule an appointment some days later for part two. Upon arrival for part two, participants were informed of a mistake in scheduling their appointment. The experimental manipulation consisted of fault attribution to the chatbot in one group, whereas no attribution to fault in the other group. In part three, participants then chose to either schedule a new appointment using the same chatbot or email. A total of N= 58 participants participated in the first part of the study, while a total of N= 30 participants completed the study. The main finding indicates that a more positive momentary emotional state towards the chatbot was related to higher self-reported trust, even after the chatbot made a mistake. However, trust did not affect trusting behavior afterwards. Considering these findings, we contribute theoretical advances to the existing trust research landscape in a setting relevant for everyday life. We also discuss potential explanations for the resulting pattern of effects and implications on chatbot design for the role of emotional states when trusting chatbots.BibTeX