Pubblicazioni

Elenco (non esaustivo) di pubblicazioni prodotte dai membri del laboratorio

126 dati « ‹ 1 di 3 › »

2025

Soprano, Michael; Eddy, Maddalena; Ros, Francesca Da; Zuliani, Maria Elena; Mizzaro, Stefano

Evaluation of Crowdsourced Peer Review using Synthetic Data and Simulations Proceedings Article

In: Cornia, Marcella; Nunzio, Giorgio Maria Di; Firmani, Donatella; Mizzaro, Stefano; Serra, Giuseppe; Tonelli, Sara; Tremamunno, Alessandro (Ed.): Proceedings of the 21st Conference on Information and Research Science Connecting to Digital and Library Science, CEUR-WS.org, Udine, Italy, 2025, ISSN: 1613-0073.

Maddalena, Eddy; Mizzaro, Stefano; Roitero, Kevin; Viviani, Marco; Barbera, David La; Modha, Sandip; Pasi, Gabriella; Ros, Francesca Da; Soprano, Michael

Report on the 14th Italian Information Retrieval Workshop (IIR 2024) Journal Article

In: SIGIR Forum, vol. 58, no 2, pp. 1–13, 2025, ISSN: 0163-5840.

Spina, Damiano; Roitero, Kevin; Mizzaro, Stefano; Mea, Vincenzo Della; Ros, Francesca Da; Soprano, Michael; Akebli, Hafsa; Falcon, Alex; Fasihi, Mehdi; Fiorin, Alessio; Barbera, David La; Bosco, Daniele Lizzio; Lunardi, Riccardo; Marturano, Alberto; Muhammad, Zaka-Ud-Din; Nascimben, Francesco; Nottebaum, Moritz; Pascoli, Massimiliano; Popescu, Mihai Horia; Rasotto, Laura; Rehman, Mubashara; Taverna, Francesco; Tomasetig, Biagio; Tremamunno, Alessandro

Report on the Hands-On PhD Course on Responsible AI from the Lens of an Information Access Researcher Journal Article

In: SIGIR Forum, vol. 58, no 2, pp. 1–61, 2025, ISSN: 0163-5840.

2024

Lunardi, Riccardo; Barbera, David La; Roitero, Kevin

The Elusiveness of Detecting Political Bias in Language Models Proceedings Article

In: Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, pp. 3922–3926, Association for Computing Machinery, Boise, ID, USA, 2024, ISBN: 9798400704369.

Abstract | Links | BibTeX

Roitero, Kevin; Soprano, Michael; Barbera, David La; Maddalena, Eddy; Mizzaro, Stefano

Enhancing Fact-Checking: From Crowdsourced Validation to Integration with Large Language Models Proceedings Article

In: Mizzaro, Stefano; Maddalena, Eddy; Viviani, Marco; Roitero, Kevin (Ed.): Proceedings of the 14th Italian Information Retrieval Workshop, pp. 74–77, CEUR-WS.org, Udine, Italy, 2024.

Abstract | Links | BibTeX

Singh, Jaspreet; Soprano, Michael; Roitero, Kevin; Ceolin, Davide

Crowdsourcing Statement Classification to Enhance Information Quality Prediction Proceedings Article

In: Preuss, Mike; Leszkiewicz, Agata; Boucher, Jean-Christopher; Fridman, Ofer; Stampe, Lucas (Ed.): Proceedings of the 6th Multidisciplinary International Symposium on Disinformation in Open Online Media (MISDOOM 2024), pp. 70–85, Springer Nature Switzerland, Münster, Germany, 2024, ISBN: 978-3-031-71210-4.

Abstract | Links | BibTeX

Soprano, Michael; Roitero, Kevin; Gadiraju, Ujwal; Maddalena, Eddy; Demartini, Gianluca

Longitudinal Loyalty: Understanding The Barriers To Running Longitudinal Studies On Crowdsourcing Platforms Journal Article

In: ACM Transactions on Social Computing, vol. 1, iss. 1, no 1, pp. 50, 2024, ISSN: 2469-7818.

Abstract | Links | BibTeX

Zeng, Xia; Barbera, David La; Roitero, Kevin; Zubiaga, Arkaitz; Mizzaro, Stefano

Combining Large Language Models and Crowdsourcing for Hybrid Human-AI Misinformation Detection Proceedings Article

In: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 2332–2336, Association for Computing Machinery, Washington DC, USA, 2024, ISBN: 9798400704314.

Abstract | Links | BibTeX

Barbera, David La; Maddalena, Eddy; Soprano, Michael; Roitero, Kevin; Demartini, Gianluca; Ceolin, Davide; Spina, Damiano; Mizzaro, Stefano

Crowdsourced Fact-checking: Does It Actually Work? Journal Article

In: Information Processing & Management, vol. 61, no 5, pp. 103792, 2024, ISSN: 0306-4573.

Abstract | Links | BibTeX

@article{BARBERA2024103792b,

title = {Crowdsourced Fact-checking: Does It Actually Work?},

author = {David La Barbera and Eddy Maddalena and Michael Soprano and Kevin Roitero and Gianluca Demartini and Davide Ceolin and Damiano Spina and Stefano Mizzaro},

url = {https://www.sciencedirect.com/science/article/pii/S0306457324001523},

doi = {10.1016/j.ipm.2024.103792},

issn = {0306-4573},

year  = {2024},

date = {2024-05-31},

urldate = {2024-05-31},

journal = {Information Processing & Management},

volume = {61},

number = {5},

pages = {103792},

abstract = {There is an important ongoing effort aimed to tackle misinformation and to perform reliable fact-checking by employing human assessors at scale, with a crowdsourcing-based approach. Previous studies on the feasibility of employing crowdsourcing for the task of misinformation detection have provided inconsistent results: some of them seem to confirm the effectiveness of crowdsourcing for assessing the truthfulness of statements and claims, whereas others fail to reach an effectiveness level higher than automatic machine learning approaches, which are still unsatisfactory. In this paper, we aim at addressing such inconsistency and understand if truthfulness assessment can indeed be crowdsourced effectively. To do so, we build on top of previous studies; we select some of those reporting low effectiveness levels, we highlight their potential limitations, and we then reproduce their work attempting to improve their setup to address those limitations. We employ various approaches, data quality levels, and agreement measures to assess the reliability of crowd workers when assessing the truthfulness of (mis)information. Furthermore, we explore different worker features and compare the results obtained with different crowds. According to our findings, crowdsourcing can be used as an effective methodology to tackle misinformation at scale. When compared to previous studies, our results indicate that a significantly higher agreement between crowd workers and experts can be obtained by using a different, higher-quality, crowdsourcing platform and by improving the design of the crowdsourcing task. Also, we find differences concerning task and worker features and how workers provide truthfulness assessments.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Baroni, Giulia Lucrezia; Rasotto, Laura; Roitero, Kevin; Tulisso, Angelica; Loreto, Carla Di; Mea, Vincenzo Della

Optimizing Vision Transformers for Histopathology: Pretraining and Normalization in Breast Cancer Classification Journal Article

In: Journal of Imaging, vol. 10, no 5, 2024, ISSN: 2313-433X.

Abstract | Links | BibTeX

Fiorin, Alessio; Pablo, Carlos López; Lejeune, Marylène; Siraj, Ameer Hamza; Mea, Vincenzo Della

Enhancing AI Research for Breast Cancer: A Comprehensive Review of Tumor-Infiltrating Lymphocyte Datasets Journal Article

In: Journal of Imaging Informatics in Medicine, 2024, ISSN: 2948-2933.

Abstract | Links | BibTeX

Soprano, Michael; Roitero, Kevin; Barbera, David La; Ceolin, Davide; Spina, Damiano; Demartini, Gianluca; Mizzaro, Stefano

Cognitive Biases in Fact-Checking and Their Countermeasures: A Review Journal Article

In: Information Processing & Management, vol. 61, no 3, pp. 103672, 2024, ISSN: 0306-4573.

Abstract | Links | BibTeX

@article{SOPRANO2024103672,

title = {Cognitive Biases in Fact-Checking and Their Countermeasures: A Review},

author = {Michael Soprano and Kevin Roitero and David La Barbera and Davide Ceolin and Damiano Spina and Gianluca Demartini and Stefano Mizzaro},

url = {https://www.sciencedirect.com/science/article/pii/S0306457324000323},

doi = {10.1016/j.ipm.2024.103672},

issn = {0306-4573},

year  = {2024},

date = {2024-02-11},

urldate = {2024-01-01},

journal = {Information Processing & Management},

volume = {61},

number = {3},

pages = {103672},

abstract = {The increase of the amount of misinformation spread every day online is a huge threat to the society. Organizations and researchers are working to contrast this misinformation plague. In this setting, human assessors are indispensable to correctly identify, assess and/or revise the truthfulness of information items, i.e., to perform the fact-checking activity. Assessors, as humans, are subject to systematic errors that might interfere with their fact-checking activity. Among such errors, cognitive biases are those due to the limits of human cognition. Although biases help to minimize the cost of making mistakes, they skew assessments away from an objective perception of information. Cognitive biases, hence, are particularly frequent and critical, and can cause errors that have a huge potential impact as they propagate not only in the community, but also in the datasets used to train automatic and semi-automatic machine learning models to fight misinformation. In this work, we present a review of the cognitive biases which might occur during the fact-checking process. In more detail, inspired by PRISMA – a methodology used for systematic literature reviews – we manually derive a list of 221 cognitive biases that may affect human assessors. Then, we select the 39 biases that might manifest during the fact-checking process, we group them into categories, and we provide a description. Finally, we present a list of 11 countermeasures that can be adopted by researchers, practitioners, and organizations to limit the effect of the identified cognitive biases on the fact-checking activity.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Baroni, Giulia L.; Rasotto, Laura; Roitero, Kevin; Siraj, Ameer Hamza; Mea, V. Della

Vision Transformers for Breast Cancer Histology Image Classification Proceedings Article

In: Foresti, Gian Luca; Fusiello, Andrea; Hancock, Edwin (Ed.): Image Analysis and Processing - ICIAP 2023 Workshops, pp. 15–26, Springer Nature Switzerland, Cham, 2024, ISBN: 978-3-031-51026-7.

Abstract | Links | BibTeX

Ros, Francesca Da; Gaspero, Luca Di; Roitero, Kevin; Barbera, David La; Mizzaro, Stefano; Mea, Vincenzo Della; Valent, Francesca; Deroma, Laura

Supporting Fair and Efficient Emergency Medical Services in a Large Heterogeneous Region Journal Article

In: Journal of Healthcare Informatics Research, 2024, ISSN: 2509-498X.

Abstract | Links | BibTeX

2023

Demartini, Gianluca; Roitero, Kevin; Mizzaro, Stefano

Data Bias Management Journal Article

In: Commun. ACM, vol. 67, no 1, pp. 28–32, 2023, ISSN: 0001-0782.

Abstract | Links | BibTeX

Soprano, Michael; Roitero, Kevin; Mea, Vincenzo Della; Mizzaro, Stefano

Towards a Conversational-Based Agent for Health Services Proceedings Article

In: Falchi, Fabrizio; Giannotti, Fosca; Monreale, Anna; Boldrini, Chiara; Rinzivillo, Salvatore; Colantonio, Sara (Ed.): Proceedings of the Italia Intelligenza Artificiale - Thematic Workshops co-located with the 3rd CINI National Lab AIIS Conference on Artificial Intelligence, pp. 278–283, CEUR-WS.org, Pisa, Italy, 2023.

Abstract | Links | BibTeX

Barbera, David La; Soprano, Michael; Roitero, Kevin; Maddalena, Eddy; Mizzaro, Stefano

Fact-Checking at Scale with Crowdsourcing: Experiments and Lessons Learned Proceedings Article

In: Nardini, Franco Maria; Tonelotto, Nicola; Faggioli, Guglielmo; Ferrara, Antonio (Ed.): Proceedings of the 13th Italian Information Retrieval Workshop, pp. 85–90, CEUR-WS.org, Pisa, Italy, 2023.

Abstract | Links | BibTeX

Roitero, Kevin; Barbera, David La; Soprano, Michael; Demartini, Gianluca; Mizzaro, Stefano; Sakai, Tetsuya

How Many Crowd Workers Do I Need? On Statistical Power When Crowdsourcing Relevance Judgments Journal Article

In: ACM Transactions on Information Systems, 2023, ISSN: 1046-8188, (Journal Ranks: Journal Citation Reports (JCR) Q1 (2021), Scimago (SJR) Q1 (2021)).

Abstract | Links | BibTeX

Xie, Haoyu; Maddalena, Eddy; Qarout, Rehab; Checco, Alessandro

The Dark Side of Recruitment in Crowdsourcing: Ethics and Transparency in Micro-Task Marketplaces Journal Article

In: Computer Supported Cooperative Work (CSCW), vol. 32, no 3, pp. 439-474, 2023, ISSN: 1573-7551.

Abstract | Links | BibTeX

Maddalena, Eddy; Ibáñez, Luis-Daniel; Reeves, Neal; Simperl, Elena

Qrowdsmith: Enhancing Paid Microtask Crowdsourcing with Gamification and Furtherance Incentives Journal Article

In: ACM Trans. Intell. Syst. Technol., 2023, ISSN: 2157-6904, (Just Accepted).

Abstract | Links | BibTeX

Roitero, Kevin; Martinuzzi, Andrea; Armellin, Maria Teresa; Paparella, Gabriella; Maniero, Alberto; Mea, Vincenzo Della

Automated ICF Coding of Rehabilitation Notes for Low-Resource Languages via Continual Training of Language Models Journal Article

In: Studies in Health Technology and Informatics, vol. 302, pp. 763–767, 2023, ISSN: 1879-8365.

Abstract | Links | BibTeX

Ceolin, Davide; Roitero, Kevin; Guo, Furong

Predicting Crowd Workers Performance: An Information Quality Case Proceedings Article

In: Garrigós, Irene; Rodríguez, Juan Manuel Murillo; Wimmer, Manuel (Ed.): Web Engineering, pp. 75–90, Springer Nature Switzerland, Cham, 2023, ISBN: 978-3-031-34444-2.

Abstract | Links | BibTeX

Roitero, Kevin; Portelli, Beatrice; Serra, Giuseppe; Mea, Vincenzo Della; Mizzaro, Stefano; Cerro, Gianni; Vitelli, Michele; Molinara, Mario

Detection of Wastewater Pollution Through Natural Language Generation With a Low-Cost Sensing Platform Journal Article

In: IEEE Access, vol. 11, pp. 50272–50284, 2023, ISSN: 2169-3536.

Abstract | Links | BibTeX

@article{10129181,

title = {Detection of Wastewater Pollution Through Natural Language Generation With a Low-Cost Sensing Platform},

author = {Kevin Roitero and Beatrice Portelli and Giuseppe Serra and Vincenzo Della Mea and Stefano Mizzaro and Gianni Cerro and Michele Vitelli and Mario Molinara},

doi = {10.1109/ACCESS.2023.3277535},

issn = {2169-3536},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

journal = {IEEE Access},

volume = {11},

pages = {50272–50284},

abstract = {The detection of contaminants in several environments (e.g., air, water, sewage systems) is of paramount importance to protect people and predict possible dangerous circumstances. Most works do this using classical Machine Learning tools that act on the acquired measurement data. This paper introduces two main elements: a low-cost platform to acquire, pre-process, and transmit data to classify contaminants in wastewater; and a novel classification approach to classify contaminants in wastewater, based on deep learning and the transformation of raw sensor data into natural language metadata. The proposed solution presents clear advantages against state-of-the-art systems in terms of higher effectiveness and reasonable efficiency. The main disadvantage of the proposed approach is that it relies on knowing the injection time, i.e., the instant in time when the contaminant is injected into the wastewater. For this reason, the developed system also includes a finite state machine tool able to infer the exact time instant when the substance is injected. The entire system is presented and discussed in detail. Furthermore, several variants of the proposed processing technique are also presented to assess the sensitivity to the number of used samples and the corresponding promptness/computational burden of the system. The lowest accuracy obtained by our technique is 91.4%, which is significantly higher than the 81.0% accuracy reached by the best baseline method.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Amigó, Enrique; Deldjoo, Yashar; Mizzaro, Stefano; Bellogín, Alejandro

A unifying and general account of fairness measurement in recommender systems Journal Article

In: Information Processing & Management, vol. 60, no 1, pp. 103115, 2023, ISSN: 0306-4573.

Abstract | Links | BibTeX

@article{AMIGO2023103115,

title = {A unifying and general account of fairness measurement in recommender systems},

author = {Enrique Amigó and Yashar Deldjoo and Stefano Mizzaro and Alejandro Bellogín},

url = {https://www.sciencedirect.com/science/article/pii/S0306457322002163},

doi = {https://doi.org/10.1016/j.ipm.2022.103115},

issn = {0306-4573},

year  = {2023},

date = {2023-01-01},

journal = {Information Processing & Management},

volume = {60},

number = {1},

pages = {103115},

abstract = {Fairness is fundamental to all information access systems, including recommender systems. However, the landscape of fairness definition and measurement is quite scattered with many competing definitions that are partial and often incompatible. There is much work focusing on specific – and different – notions of fairness and there exist dozens of metrics of fairness in the literature, many of them redundant and most of them incompatible. In contrast, to our knowledge, there is no formal framework that covers all possible variants of fairness and allows developers to choose the most appropriate variant depending on the particular scenario. In this paper, we aim to define a general, flexible, and parameterizable framework that covers a whole range of fairness evaluation possibilities. Instead of modeling the metrics based on an abstract definition of fairness, the distinctive feature of this study compared to the current state of the art is that we start from the metrics applied in the literature to obtain a unified model by generalization. The framework is grounded on a general work hypothesis: interpreting the space of users and items as a probabilistic sample space, two fundamental measures in information theory (Kullback–Leibler Divergence and Mutual Information) can capture the majority of possible scenarios for measuring fairness on recommender system outputs. In addition, earlier research on fairness in recommender systems could be viewed as single-sided, trying to optimize some form of equity across either user groups or provider/procurer groups, without considering the user/item space in conjunction, thereby overlooking/disregarding the interplay between user and item groups. Instead, our framework includes the notion of statistical independence between user and item groups. We finally validate our approach experimentally on both synthetic and real data according to a wide range of state-of-the-art recommendation algorithms and real-world data sets, showing that with our framework we can measure fairness in a general, uniform, and meaningful way.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Fairness is fundamental to all information access systems, including recommender systems. However, the landscape of fairness definition and measurement is quite scattered with many competing definitions that are partial and often incompatible. There is much work focusing on specific – and different – notions of fairness and there exist dozens of metrics of fairness in the literature, many of them redundant and most of them incompatible. In contrast, to our knowledge, there is no formal framework that covers all possible variants of fairness and allows developers to choose the most appropriate variant depending on the particular scenario. In this paper, we aim to define a general, flexible, and parameterizable framework that covers a whole range of fairness evaluation possibilities. Instead of modeling the metrics based on an abstract definition of fairness, the distinctive feature of this study compared to the current state of the art is that we start from the metrics applied in the literature to obtain a unified model by generalization. The framework is grounded on a general work hypothesis: interpreting the space of users and items as a probabilistic sample space, two fundamental measures in information theory (Kullback–Leibler Divergence and Mutual Information) can capture the majority of possible scenarios for measuring fairness on recommender system outputs. In addition, earlier research on fairness in recommender systems could be viewed as single-sided, trying to optimize some form of equity across either user groups or provider/procurer groups, without considering the user/item space in conjunction, thereby overlooking/disregarding the interplay between user and item groups. Instead, our framework includes the notion of statistical independence between user and item groups. We finally validate our approach experimentally on both synthetic and real data according to a wide range of state-of-the-art recommendation algorithms and real-world data sets, showing that with our framework we can measure fairness in a general, uniform, and meaningful way.

Amigó, Enrique; Gonzalo, Julio; Mizzaro, Stefano

What is My Problem? Identifying Formal Tasks and Metrics in Data Mining on the Basis of Measurement Theory Journal Article

In: IEEE Transactions on Knowledge and Data Engineering, vol. 35, no 2, pp. 2147–2157, 2023.

2022

Brand, Erik; Roitero, Kevin; Soprano, Michael; Rahimi, Afshin; Demartini, Gianluca

A Neural Model to Jointly Predict and Explain Truthfulness of Statements Journal Article

In: J. Data and Information Quality, 2022, ISSN: 1936-1955, (Just Accepted).

Abstract | Links | BibTeX

Qu, Yunke; Barbera, David La; Roitero, Kevin; Mizzaro, Stefano; Spina, Damiano; Demartini, Gianluca

Combining Human and Machine Confidence in Truthfulness Assessment Journal Article

In: J. Data and Information Quality, 2022, ISSN: 1936-1955, (Just Accepted).

Abstract | Links | BibTeX

Soprano, Michael; Roitero, Kevin; Bona, Francesco Bombassei De; Mizzaro, Stefano

Crowd_Frame: A Simple and Complete Framework to Deploy Complex Crowdsourcing Tasks Off-the-Shelf Proceedings Article

In: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining, pp. 1605–1608, Association for Computing Machinery, Virtual Event, AZ, USA, 2022, ISBN: 9781450391320.

Abstract | Links | BibTeX

Roitero, Kevin; Checco, Alessandro; Mizzaro, Stefano; Demartini, Gianluca

Preferences on a Budget: Prioritizing Document Pairs When Crowdsourcing Relevance Judgments Proceedings Article

In: Proceedings of the ACM Web Conference 2022, pp. 319–327, Association for Computing Machinery, Virtual Event, Lyon, France, 2022, ISBN: 9781450390965.

Abstract | Links | BibTeX

@inproceedings{10.1145/3485447.3511960,

title = {Preferences on a Budget: Prioritizing Document Pairs When Crowdsourcing Relevance Judgments},

author = {Kevin Roitero and Alessandro Checco and Stefano Mizzaro and Gianluca Demartini},

url = {https://doi.org/10.1145/3485447.3511960},

doi = {10.1145/3485447.3511960},

isbn = {9781450390965},

year  = {2022},

date = {2022-01-01},

booktitle = {Proceedings of the ACM Web Conference 2022},

pages = {319–327},

publisher = {Association for Computing Machinery},

address = {Virtual Event, Lyon, France},

series = {WWW '22},

abstract = {In Information Retrieval (IR) evaluation, preference judgments are collected by presenting to the assessors a pair of documents and asking them to select which of the two, if any, is the most relevant. This is an alternative to the classic relevance judgment approach, in which human assessors judge the relevance of a single document on a scale; such an alternative allows to make relative rather than absolute judgments of relevance. While preference judgments are easier for human assessors to perform, the number of possible document pairs to be judged is usually so high that it makes it unfeasible to judge them all. Thus, following a similar idea to pooling strategies for single document relevance judgments where the goal is to sample the most useful documents to be judged, in this work we focus on analyzing alternative ways to sample document pairs to judge, in order to maximize the value of a fixed number of preference judgments that can feasibly be collected. Such value is defined as how well we can evaluate IR systems given a budget, that is, a fixed number of human preference judgments that may be collected. By relying on several datasets featuring relevance judgments gathered by means of experts and crowdsourcing, we experimentally compare alternative strategies to select document pairs and show how different strategies lead to different IR evaluation result quality levels. Our results show that, by using the appropriate procedure, it is possible to achieve good IR evaluation results with a limited number of preference judgments, thus confirming the feasibility of using preference judgments to create IR evaluation collections.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Barbera, David La; Roitero, Kevin; Mackenzie, Joel; Damiano, Spina; Demartini, Gianluca; Mizzaro, Stefano

BUM at CheckThat! 2022: A Composite Deep Learning Approach to Fake News Detection using Evidence Retrieval Proceedings Article

In: andd Ferro Faggioli, Nicola Guglielmo; Hanbury, Allan; Potthast, Martin (Ed.): Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, 2022.

Draws, Tim; Barbera, David La; Soprano, Michael; Roitero, Kevin; Ceolin, Davide; Checco, Alessandro; Mizzaro, Stefano

The Effects of Crowd Worker Biases in Fact-Checking Tasks Proceedings Article

In: 2022 ACM Conference on Fairness, Accountability, and Transparency, pp. 2114–2124, Association for Computing Machinery, Seoul, Republic of Korea, 2022, ISBN: 9781450393522.

Abstract | Links | BibTeX

Ceschia, Sara; Roitero, Kevin; Demartini, Gianluca; Mizzaro, Stefano; Gaspero, Luca Di; Schaerf, Andrea

Task design in complex crowdsourcing experiments: Item assignment optimization Journal Article

In: Computers & Operations Research, pp. 105995, 2022, ISSN: 0305-0548.

Ceolin, Davide; Primiero, Giuseppe; Soprano, Michael; Wielemaker, Jan

Transparent Assessment of Information Quality of Online Reviews Using Formal Argumentation Theory Journal Article

In: Information Systems, vol. 110, pp. 102107, 2022, ISSN: 0306-4379, (Journal Ranks: Journal Citation Reports (JCR) Q2 (2021), Scimago (SJR) Q1 (2021)).

Abstract | Links | BibTeX

@article{CEOLIN2022102107,

title = {Transparent Assessment of Information Quality of Online Reviews Using Formal Argumentation Theory},

author = {Davide Ceolin and Giuseppe Primiero and Michael Soprano and Jan Wielemaker},

doi = {10.1016/j.is.2022.102107},

issn = {0306-4379},

year  = {2022},

date = {2022-01-01},

journal = {Information Systems},

volume = {110},

pages = {102107},

abstract = {Review scores collect users’ opinions in a simple and intuitive manner. However, review scores are also easily manipulable, hence they are often accompanied by explanations. A substantial amount of research has been devoted to ascertaining the quality of reviews, to identify the most useful and authentic scores through explanation analysis. In this paper, we advance the state of the art in review quality analysis. We introduce a rating system to identify review arguments and to define an appropriate weighted semantics through formal argumentation theory. We introduce an algorithm to construct a corresponding graph, based on a selection of weighted arguments, their semantic distance, and the supported ratings. We also provide an algorithm to identify the model of such an argumentation graph, maximizing the overall weight of the admitted nodes and edges. We evaluate these contributions on the Amazon review dataset by McAuley et al. (2015), by comparing the results of our argumentation assessment with the upvotes received by the reviews. Also, we deepen the evaluation by crowdsourcing a multidimensional assessment of reviews and comparing it to the argumentation assessment. Lastly, we perform a user study to evaluate the explainability of our method, i.e., to test whether the automated method we use to assess reviews is understandable by humans. Our method achieves two goals: (1) it identifies reviews that are considered useful, comprehensible, and complete by online users, and does so in an unsupervised manner, and (2) it provides an explanation of quality assessments.},

note = {Journal Ranks: Journal Citation Reports (JCR) Q2 (2021), Scimago (SJR) Q1 (2021)},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Amigó, Enrique; Mizzaro, Stefano; Spina, Damiano

Ranking Interruptus: When Truncated Rankings Are Better and How to Measure That Proceedings Article

In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 588–598, Association for Computing Machinery, New York, NY, USA, 2022, ISBN: 9781450387323.

Abstract | Links | BibTeX

2021

Brand, Erik; Roitero, Kevin; Soprano, Michael; Demartini, Gianluca

E-BART: Jointly Predicting and Explaining Truthfulness Proceedings Article

In: Augenstein, Isabelle; Papotti, Paolo; Wright, Dustin (Ed.): Proceedings of the 2021 Truth and Trust Online Conference (TTO 2021), Virtual, October 7-8, 2021, pp. 18–27, Hacks Hackers, 2021.

Abstract | Links | BibTeX

Roitero, Kevin; Soprano, Michael; Portelli, Beatrice; Luise, Massimiliano De; Spina, Damiano; Mea, Vincenzo Della; Serra, Giuseppe; Mizzaro, Stefano; Demartini, Gianluca

Can The Crowd Judge Truthfulness? A Longitudinal Study on Recent Misinformation About COVID-19 Journal Article

In: Personal and Ubiquitous Computing, 2021, ISSN: 1617-4917.

Abstract | Links | BibTeX

@article{journal-paper-puc-2021,

title = {Can The Crowd Judge Truthfulness? A Longitudinal Study on Recent Misinformation About COVID-19},

author = {Kevin Roitero and Michael Soprano and Beatrice Portelli and Massimiliano De Luise and Damiano Spina and Vincenzo Della Mea and Giuseppe Serra and Stefano Mizzaro and Gianluca Demartini},

url = {https://doi.org/10.1007/s00779-021-01604-6},

doi = {10.1007/s00779-021-01604-6},

issn = {1617-4917},

year  = {2021},

date = {2021-01-01},

journal = {Personal and Ubiquitous Computing},

abstract = {Recently, the misinformation problem has been addressed with a crowdsourcing-based approach: to assess the truthfulness of a statement, instead of relying on a few experts, a crowd of non-expert is exploited. We study whether crowdsourcing is an effective and reliable method to assess truthfulness during a pandemic, targeting statements related to COVID-19, thus addressing (mis)information that is both related to a sensitive and personal issue and very recent as compared to when the judgment is done. In our experiments, crowd workers are asked to assess the truthfulness of statements, and to provide evidence for the assessments. Besides showing that the crowd is able to accurately judge the truthfulness of the statements, we report results on workers' behavior, agreement among workers, effect of aggregation functions, of scales transformations, and of workers background and bias. We perform a longitudinal study by re-launching the task multiple times with both novice and experienced workers, deriving important insights on how the behavior and quality change over time. Our results show that workers are able to detect and objectively categorize online (mis)information related to COVID-19; both crowdsourced and expert judgments can be transformed and aggregated to improve quality; worker background and other signals (e.g., source of information, behavior) impact the quality of the data. The longitudinal study demonstrates that the time-span has a major effect on the quality of the judgments, for both novice and experienced workers. Finally, we provide an extensive failure analysis of the statements misjudged by the crowd-workers.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Soprano, Michael; Roitero, Kevin; Barbera, David La; Ceolin, Davide; Spina, Damiano; Mizzaro, Stefano; Demartini, Gianluca

The Many Dimensions of Truthfulness: Crowdsourcing Misinformation Assessments on a Multidimensional Scale Journal Article

In: Information Processing & Management, vol. 58, no 6, pp. 102710, 2021, ISSN: 0306-4573.

Abstract | Links | BibTeX

@article{journal-paper-ipm-2021,

title = {The Many Dimensions of Truthfulness: Crowdsourcing Misinformation Assessments on a Multidimensional Scale},

author = {Michael Soprano and Kevin Roitero and David La Barbera and Davide Ceolin and Damiano Spina and Stefano Mizzaro and Gianluca Demartini},

url = {https://www.sciencedirect.com/science/article/pii/S0306457321001941},

doi = {https://doi.org/10.1016/j.ipm.2021.102710},

issn = {0306-4573},

year  = {2021},

date = {2021-01-01},

journal = {Information Processing & Management},

volume = {58},

number = {6},

pages = {102710},

abstract = {Recent work has demonstrated the viability of using crowdsourcing as a tool for evaluating the truthfulness of public statements. Under certain conditions such as: (1) having a balanced set of workers with different backgrounds and cognitive abilities; (2) using an adequate set of mechanisms to control the quality of the collected data; and (3) using a coarse grained assessment scale, the crowd can provide reliable identification of fake news. However, fake news are a subtle matter: statements can be just biased (“cherrypicked”), imprecise, wrong, etc. and the unidimensional truth scale used in existing work cannot account for such differences. In this paper we propose a multidimensional notion of truthfulness and we ask the crowd workers to assess seven different dimensions of truthfulness selected based on existing literature: Correctness, Neutrality, Comprehensibility, Precision, Completeness, Speaker’s Trustworthiness, and Informativeness. We deploy a set of quality control mechanisms to ensure that the thousands of assessments collected on 180 publicly available fact-checked statements distributed over two datasets are of adequate quality, including a custom search engine used by the crowd workers to find web pages supporting their truthfulness assessments. A comprehensive analysis of crowdsourced judgments shows that: (1) the crowdsourced assessments are reliable when compared to an expert-provided gold standard; (2) the proposed dimensions of truthfulness capture independent pieces of information; (3) the crowdsourcing task can be easily learned by the workers; and (4) the resulting assessments provide a useful basis for a more complete estimation of statement truthfulness.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Ceolin, Davide; Primiero, Giuseppe; Wielemaker, Jan; Soprano, Michael

Assessing the Quality of Online Reviews Using Formal Argumentation Theory Proceedings Article

In: Brambilla, Marco; Chbeir, Richard; Frasincar, Flavius; Manolescu, Ioana (Ed.): Web Engineering, pp. 71–87, Springer International Publishing, Cham, 2021, ISBN: 978-3-030-74296-6.

Abstract | Links | BibTeX

@inproceedings{10.1007/978-3-030-74296-6_6,

title = {Assessing the Quality of Online Reviews Using Formal Argumentation Theory},

author = {Davide Ceolin and Giuseppe Primiero and Jan Wielemaker and Michael Soprano},

editor = {Marco Brambilla and Richard Chbeir and Flavius Frasincar and Ioana Manolescu},

doi = {10.1007/978-3-030-74296-6_6},

isbn = {978-3-030-74296-6},

year  = {2021},

date = {2021-01-01},

booktitle = {Web Engineering},

pages = {71--87},

publisher = {Springer International Publishing},

address = {Cham},

abstract = {Review scores collect users' opinions in a simple and intuitive manner. However, review scores are also easily manipulable, hence they are often accompanied by explanations. A substantial amount of research has been devoted to ascertaining the quality of reviews, to identify the most useful and authentic scores through explanation analysis. In this paper, we advance the state of the art in review quality analysis. We introduce a rating system to identify review arguments and to define an appropriate weighted semantics through formal argumentation theory. We introduce an algorithm to construct a corresponding graph, based on a selection of weighted arguments, their semantic similarity, and the supported ratings. We provide an algorithm to identify the model of such an argumentation graph, maximizing the overall weight of the admitted nodes and edges. We evaluate these contributions on the Amazon review dataset by McAuley et al. [15], by comparing the results of our argumentation assessment with the upvotes received by the reviews. Also, we deepen the evaluation by crowdsourcing a multidimensional assessment of reviews and comparing it to the argumentation assessment. Lastly, we perform a user study to evaluate the explainability of our method. Our method achieves two goals: (1) it identifies reviews that are considered useful, comprehensible, truthful by online users and does so in an unsupervised manner, and (2) it provides an explanation of quality assessments.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Qu, Yunke; Roitero, Kevin; Mizzaro, Stefano; Spina, Damiano; Demartini, Gianluca

Human-in-the-Loop Systems for Truthfulness: A Study of Human and Machine Confidence Proceedings Article

In: Augenstein, Isabelle; Papotti, Paolo; Wright, Dustin (Ed.): Proceedings of the 2021 Truth and Trust Online Conference (TTO 2021), Virtual, October 7-8, 2021, pp. 40–49, Hacks Hackers, 2021.

Roitero, Kevin; Portelli, Beatrice; Popescu, Mihai Horia; Mea, Vincenzo Della

DiLBERT: Cheap Embeddings for Disease Related Medical NLP Journal Article

In: IEEE Access, vol. 9, pp. 159714-159723, 2021.

Demartini, Gianluca; Roitero, Kevin; Mizzaro, Stefano

Managing Bias in Human-Annotated Data: Moving Beyond Bias Removal Journal Article

In: CoRR, vol. abs/2110.13504, 2021.

Conde-Sousa, Eduardo; Vale, João; Feng, Ming; Xu, Kele; Wang, Yin; Mea, Vincenzo Della; Barbera, David La; Montahaei, Ehsan; Baghshah, Mahdieh Soleymani; Turzynski, Andreas; Gildenblat, Jacob; Klaiman, Eldad; Hong, Yiyu; Aresta, Guilherme; Araújo, Teresa; Aguiar, Paulo; Eloy, Catarina; Polónia, António

HEROHE Challenge: assessing HER2 status in breast cancer without immunohistochemistry or in situ hybridization Miscellaneous

2021.

Barbera, David La; Roitero, Kevin; Mizzaro, Stefano; Mea, Vincenzo Della; Valent, Francesca

A Software Simulator for Optimizing Ambulance Location and Response Time: A Preliminary Report Proceedings Article

In: 2021 IEEE International Conference on Digital Health (ICDH), pp. 209-211, 2021.

2020

Barbera, David La; Polónia, António; Roitero, Kevin; Conde-Sousa, Eduardo; Mea, Vincenzo Della

Detection of HER2 from Haematoxylin-Eosin Slides Through a Cascade of Deep Learning Classifiers via Multi-Instance Learning Journal Article

In: Journal of Imaging, vol. 6, no 9, 2020, ISSN: 2313-433X.

Abstract | Links | BibTeX

Roitero, Kevin; Soprano, Michael; Fan, Shaoyang; Spina, Damiano; Mizzaro, Stefano; Demartini, Gianluca

Can The Crowd Identify Misinformation Objectively? The Effects of Judgment Scale and Assessor's Background Proceedings Article

In: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 439–448, Association for Computing Machinery, Virtual Event, China, 2020, ISBN: 9781450380164.

Abstract | Links | BibTeX

Roitero, Kevin; Culpepper, Shane J; Sanderson, Mark; Scholer, Falk; Mizzaro, Stefano

Fewer topics? A million topics? Both?! On topics subsets in test collections Journal Article

In: Inf. Retr. J., vol. 23, no 1, pp. 49–85, 2020.

Han, Lei; Maddalena, Eddy; Checco, Alessandro; Sarasua, Cristina; Gadiraju, Ujwal; Roitero, Kevin; Demartini, Gianluca

Crowd Worker Strategies in Relevance Judgment Tasks Proceedings Article

In: Proceedings of the 13th International Conference on Web Search and Data Mining, pp. 241–249, Association for Computing Machinery, Houston, TX, USA, 2020, ISBN: 9781450368223.

Abstract | Links | BibTeX

@inproceedings{10.1145/3336191.3371857,

title = {Crowd Worker Strategies in Relevance Judgment Tasks},

author = {Lei Han and Eddy Maddalena and Alessandro Checco and Cristina Sarasua and Ujwal Gadiraju and Kevin Roitero and Gianluca Demartini},

url = {https://doi.org/10.1145/3336191.3371857},

doi = {10.1145/3336191.3371857},

isbn = {9781450368223},

year  = {2020},

date = {2020-01-01},

booktitle = {Proceedings of the 13th International Conference on Web Search and Data Mining},

pages = {241–249},

publisher = {Association for Computing Machinery},

address = {Houston, TX, USA},

series = {WSDM '20},

abstract = {Crowdsourcing is a popular technique to collect large amounts of human-generated labels, such as relevance judgments used to create information retrieval (IR) evaluation collections. Previous research has shown how collecting high quality labels from a crowdsourcing platform can be challenging. Existing quality assurance techniques focus on answer aggregation or on the use of gold questions where ground-truth data allows to check for the quality of the responses.In this paper, we present qualitative and quantitative results, revealing how different crowd workers adopt different work strategies to complete relevance judgment tasks efficiently and their consequent impact on quality. We delve into the techniques and tools that highly experienced crowd workers use to be more efficient in completing crowdsourcing micro-tasks. To this end, we use both qualitative results from worker interviews and surveys, as well as the results of a data-driven study of behavioral log data (i.e., clicks, keystrokes and keyboard shortcuts) collected from crowd workers performing relevance judgment tasks. Our results highlight the presence of frequently used shortcut patterns that can speed-up task completion, thus increasing the hourly wage of efficient workers. We observe how crowd work experiences result in different types of working strategies, productivity levels, quality and diversity of the crowdsourced judgments.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Roitero, Kevin; Brunello, Andrea; Serra, Giuseppe; Mizzaro, Stefano

Effectiveness evaluation without human relevance judgments: A systematic analysis of existing methods and of their combinations Journal Article

In: Information Processing & Management, vol. 57, no 2, pp. 102149, 2020, ISSN: 0306-4573.

Abstract | Links | BibTeX

Roitero, Kevin; Soprano, Michael; Portelli, Beatrice; Spina, Damiano; Mea, Vincenzo Della; Serra, Giuseppe; Mizzaro, Stefano; Demartini, Gianluca

The COVID-19 Infodemic: Can the Crowd Judge Recent Misinformation Objectively? Proceedings Article

In: Proceedings of the 29th ACM International Conference on Information and Knowledge Management (CIKM2020). Galway, Ireland (Online). October 19-23, 2020. Conference Rank: GGS A+, Core A, pp. 1305–1314, Association for Computing Machinery, Virtual Event, Ireland, 2020, ISBN: 9781450368599.

Abstract | Links | BibTeX

@inproceedings{conference-paper-cikm2020,

title = {The COVID-19 Infodemic: Can the Crowd Judge Recent Misinformation Objectively?},

author = {Kevin Roitero and Michael Soprano and Beatrice Portelli and Damiano Spina and Vincenzo Della Mea and Giuseppe Serra and Stefano Mizzaro and Gianluca Demartini},

url = {https://doi.org/10.1145/3340531.3412048},

doi = {10.1145/3340531.3412048},

isbn = {9781450368599},

year  = {2020},

date = {2020-01-01},

booktitle = {Proceedings of the 29th ACM International Conference on Information and Knowledge Management (CIKM2020). Galway, Ireland (Online). October 19-23, 2020. Conference Rank: GGS A+, Core A},

pages = {1305–1314},

publisher = {Association for Computing Machinery},

address = {Virtual Event, Ireland},

series = {CIKM '20},

abstract = {Misinformation is an ever increasing problem that is difficult to solve for the research community and has a negative impact on the society at large. Very recently, the problem has been addressed with a crowdsourcing-based approach to scale up labeling efforts: to assess the truthfulness of a statement, instead of relying on a few experts, a crowd of (non-expert) judges is exploited. We follow the same approach to study whether crowdsourcing is an effective and reliable method to assess statements truthfulness during a pandemic. We specifically target statements related to the COVID-19 health emergency, that is still ongoing at the time of the study and has arguably caused an increase of the amount of misinformation that is spreading online (a phenomenon for which the term "infodemic" has been used). By doing so, we are able to address (mis)information that is both related to a sensitive and personal issue like health and very recent as compared to when the judgment is done: two issues that have not been analyzed in related work.In our experiment, crowd workers are asked to assess the truthfulness of statements, as well as to provide evidence for the assessments as a URL and a text justification. Besides showing that the crowd is able to accurately judge the truthfulness of the statements, we also report results on many different aspects, including: agreement among workers, the effect of different aggregation functions, of scales transformations, and of workers background / bias. We also analyze workers behavior, in terms of queries submitted, URLs found / selected, text justifications, and other behavioral data like clicks and mouse actions collected by means of an ad hoc logger.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Roitero, Kevin; Soprano, Michael; Fan, Shaoyang; Spina, Damiano; Mizzaro, Stefano; Demartini, Gianluca

Can The Crowd Identify Misinformation Objectively? The Effects of Judgment Scale and Assessor's Background Proceedings Article

In: Proceedings of the 43st International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2020). Xi’an, China (Online). July 25-30, 2020. Conference Rank: GGS A++, Core A*, pp. 439–448, Association for Computing Machinery, Virtual Event, China, 2020, ISBN: 9781450380164.

Abstract | Links | BibTeX

126 dati « ‹ 1 di 3 › »