Pubblicazioni – Laboratorio Social, Mobile, Data & Crowd

Soprano, Michael; Modha, Sandip; Roitero, Kevin; Maddalena, Eddy; Viviani, Marco; Pasi, Gabriella; Mizzaro, Stefano

AIDME: A Scalable, Interpretable Framework for AI-Aided Scoping Reviews Proceedings Article

In: Proceedings of the 2025 International ACM SIGIR Conference on Innovative Concepts and Theories in Information Retrieval (ICTIR), pp. 194–207, Association for Computing Machinery, Padua, Italy, 2025, ISBN: 9798400718618.

Abstract | Links | BibTeX

@inproceedings{10.1145/3731120.3744586,

title = {AIDME: A Scalable, Interpretable Framework for AI-Aided Scoping Reviews},

author = {Michael Soprano and Sandip Modha and Kevin Roitero and Eddy Maddalena and Marco Viviani and Gabriella Pasi and Stefano Mizzaro},

url = {https://doi.org/10.1145/3731120.3744586},

doi = {10.1145/3731120.3744586},

isbn = {9798400718618},

year  = {2025},

date = {2025-07-18},

urldate = {2025-01-01},

booktitle = {Proceedings of the 2025 International ACM SIGIR Conference on Innovative Concepts and Theories in Information Retrieval (ICTIR)},

pages = {194–207},

publisher = {Association for Computing Machinery},

address = {Padua, Italy},

series = {ICTIR '25},

abstract = {Scientific publishing is expanding rapidly across disciplines, making it increasingly difficult for researchers to organize, filter, and synthesize the literature. Systematic reviews address this challenge through structured analysis, but the early stages, particularly the screening phase, can become overwhelming when faced with thousands of records. Scoping reviews are often used as a preparatory step to explore and structure the literature before applying stricter protocols such as the PRISMA 2020 guidelines. In this work, we introduce AIDME (AI-Aided Document Mapping and Evaluation), a general-purpose framework that leverages Large Language Models (LLMs), topic modeling, thematic labeling, and citation network analysis to support the creation of scoping reviews in research areas with high publication volume. AIDME enables scalable filtering, clustering, labeling, and prioritization of publications while preserving human oversight. We evaluate the proposed framework through a case study on methods for assessing truthfulness in fact-checking, a fast-evolving field characterized by inconsistent terminology and fragmented methodologies. Our results show that AIDME reduces manual effort and produces structured outputs that facilitate subsequent PRISMA-compliant systematic reviews.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Demartini, Gianluca; Hauff, Claudia; Lease, Matthew; Mizzaro, Stefano; Roitero, Kevin; Sanderson, Mark; Scholer, Falk; Shah, Chirag; Spina, Damiano; Thomas, Paul; Vries, Arjen P.; Zuccon, Guido

Preaching to the ChoIR: Lessons IR Should Share with AI Proceedings Article

In: Proceedings of the 2025 International ACM SIGIR Conference on Innovative Concepts and Theories in Information Retrieval (ICTIR), pp. 78–91, Association for Computing Machinery, Padua, Italy, 2025, ISBN: 9798400718618.

Abstract | Links | BibTeX

@inproceedings{10.1145/3731120.3744612,

title = {Preaching to the ChoIR: Lessons IR Should Share with AI},

author = {Gianluca Demartini and Claudia Hauff and Matthew Lease and Stefano Mizzaro and Kevin Roitero and Mark Sanderson and Falk Scholer and Chirag Shah and Damiano Spina and Paul Thomas and Arjen P. Vries and Guido Zuccon},

url = {https://doi.org/10.1145/3731120.3744612},

doi = {10.1145/3731120.3744612},

isbn = {9798400718618},

year  = {2025},

date = {2025-07-18},

urldate = {2025-01-01},

booktitle = {Proceedings of the 2025 International ACM SIGIR Conference on Innovative Concepts and Theories in Information Retrieval (ICTIR)},

pages = {78–91},

publisher = {Association for Computing Machinery},

address = {Padua, Italy},

series = {ICTIR '25},

abstract = {The field of Information Retrieval (IR) changed profoundly at the end of the 1990s with the rise of Web Search, and there are parallels with developments in Artificial Intelligence (AI) happening today with the advent of ChatGPT, Large Language Models, and Generative AI. We acknowledge that there are clear differences between IR and AI. For example, IR is a much smaller field, and new problems arise, like data contamination that may affect benchmark-based evaluation of AI systems. But looking through the lens of an IR researcher, there are many striking similarities between the two fields of IR (25 years ago) and AI (today), and many topics appearing in discussions in AI resemble those of 25 years ago in IR: benchmark reliability and robust evaluation, reproducibility of results for non-public models, privacy and copyright issues, efficiency and scalability, etc. In this paper, we discuss similarities and differences between IR and AI and then derive some lessons learned in the field of IR as a list of recommendations - urging the IR community to reflect on, discuss, and convey these lessons to the AI field. We believe that a joint community effort by all IR researchers is both necessary and dutiful to obtain a fruitful discussion and research advancements with the AI community.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Barbera, David La; Lunardi, Riccardo; Zhuang, Mengdie; Roitero, Kevin

Impersonating the Crowd: Evaluating LLMs' Ability to Replicate Human Judgment in Misinformation Assessment Proceedings Article

In: Proceedings of the 2025 International ACM SIGIR Conference on Innovative Concepts and Theories in Information Retrieval (ICTIR), pp. 12–21, Association for Computing Machinery, Padua, Italy, 2025, ISBN: 9798400718618.

Abstract | Links | BibTeX

Soprano, Michael; Tapu, Denis Eduard; Barbera, David La; Roitero, Kevin; Mizzaro, Stefano

The Magnitude of Truth: On Using Magnitude Estimation for Truthfulness Assessment Proceedings Article

In: Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 446–456, Association for Computing Machinery, Padua, Italy, 2025, ISBN: 9798400715921.

Abstract | Links | BibTeX

Roitero, Kevin; Wright, Dustin; Soprano, Michael; Augenstein, Isabelle; Mizzaro, Stefano

Efficiency and Effectiveness of LLM-Based Summarization of Evidence in Crowdsourced Fact-Checking Proceedings Article

In: Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 457–467, Association for Computing Machinery, Padua, Italy, 2025, ISBN: 9798400715921.

Abstract | Links | BibTeX

Lunardi, Riccardo; Soprano, Michael; Coppola, Paolo; Mea, Vincenzo Della; Mizzaro, Stefano; Roitero, Kevin

PILs of Knowledge: A Synthetic Benchmark for Evaluating Question Answering Systems in Healthcare Proceedings Article

In: Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 3648–3658, Association for Computing Machinery, Padua, Italy, 2025, ISBN: 9798400715921.

Abstract | Links | BibTeX

Soprano, Michael; Eddy, Maddalena; Ros, Francesca Da; Zuliani, Maria Elena; Mizzaro, Stefano

Evaluation of Crowdsourced Peer Review using Synthetic Data and Simulations Proceedings Article

In: Cornia, Marcella; Nunzio, Giorgio Maria Di; Firmani, Donatella; Mizzaro, Stefano; Serra, Giuseppe; Tonelli, Sara; Tremamunno, Alessandro (Ed.): Proceedings of the 21st Conference on Information and Research Science Connecting to Digital and Library Science, CEUR-WS.org, Udine, Italy, 2025, ISSN: 1613-0073.

Links | BibTeX

Maddalena, Eddy; Mizzaro, Stefano; Roitero, Kevin; Viviani, Marco; Barbera, David La; Modha, Sandip; Pasi, Gabriella; Ros, Francesca Da; Soprano, Michael

Report on the 14th Italian Information Retrieval Workshop (IIR 2024) Journal Article

In: SIGIR Forum, vol. 58, no 2, pp. 1–13, 2025, ISSN: 0163-5840.

Links | BibTeX

Spina, Damiano; Roitero, Kevin; Mizzaro, Stefano; Mea, Vincenzo Della; Ros, Francesca Da; Soprano, Michael; Akebli, Hafsa; Falcon, Alex; Fasihi, Mehdi; Fiorin, Alessio; Barbera, David La; Bosco, Daniele Lizzio; Lunardi, Riccardo; Marturano, Alberto; Muhammad, Zaka-Ud-Din; Nascimben, Francesco; Nottebaum, Moritz; Pascoli, Massimiliano; Popescu, Mihai Horia; Rasotto, Laura; Rehman, Mubashara; Taverna, Francesco; Tomasetig, Biagio; Tremamunno, Alessandro

Report on the Hands-On PhD Course on Responsible AI from the Lens of an Information Access Researcher Journal Article

In: SIGIR Forum, vol. 58, no 2, pp. 1–61, 2025, ISSN: 0163-5840.

Links | BibTeX

Lunardi, Riccardo; Barbera, David La; Roitero, Kevin

The Elusiveness of Detecting Political Bias in Language Models Proceedings Article

In: Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, pp. 3922–3926, Association for Computing Machinery, Boise, ID, USA, 2024, ISBN: 9798400704369.

Abstract | Links | BibTeX

Roitero, Kevin; Soprano, Michael; Barbera, David La; Maddalena, Eddy; Mizzaro, Stefano

Enhancing Fact-Checking: From Crowdsourced Validation to Integration with Large Language Models Proceedings Article

In: Mizzaro, Stefano; Maddalena, Eddy; Viviani, Marco; Roitero, Kevin (Ed.): Proceedings of the 14th Italian Information Retrieval Workshop, pp. 74–77, CEUR-WS.org, Udine, Italy, 2024.

Abstract | Links | BibTeX

Singh, Jaspreet; Soprano, Michael; Roitero, Kevin; Ceolin, Davide

Crowdsourcing Statement Classification to Enhance Information Quality Prediction Proceedings Article

In: Preuss, Mike; Leszkiewicz, Agata; Boucher, Jean-Christopher; Fridman, Ofer; Stampe, Lucas (Ed.): Proceedings of the 6th Multidisciplinary International Symposium on Disinformation in Open Online Media (MISDOOM 2024), pp. 70–85, Springer Nature Switzerland, Münster, Germany, 2024, ISBN: 978-3-031-71210-4.

Abstract | Links | BibTeX

Soprano, Michael; Roitero, Kevin; Gadiraju, Ujwal; Maddalena, Eddy; Demartini, Gianluca

Longitudinal Loyalty: Understanding The Barriers To Running Longitudinal Studies On Crowdsourcing Platforms Journal Article

In: ACM Transactions on Social Computing, vol. 1, iss. 1, no 1, pp. 50, 2024, ISSN: 2469-7818.

Abstract | Links | BibTeX

Zeng, Xia; Barbera, David La; Roitero, Kevin; Zubiaga, Arkaitz; Mizzaro, Stefano

Combining Large Language Models and Crowdsourcing for Hybrid Human-AI Misinformation Detection Proceedings Article

In: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 2332–2336, Association for Computing Machinery, Washington DC, USA, 2024, ISBN: 9798400704314.

Abstract | Links | BibTeX

Barbera, David La; Maddalena, Eddy; Soprano, Michael; Roitero, Kevin; Demartini, Gianluca; Ceolin, Davide; Spina, Damiano; Mizzaro, Stefano

Crowdsourced Fact-checking: Does It Actually Work? Journal Article

In: Information Processing & Management, vol. 61, no 5, pp. 103792, 2024, ISSN: 0306-4573.

Abstract | Links | BibTeX

@article{BARBERA2024103792b,

title = {Crowdsourced Fact-checking: Does It Actually Work?},

author = {David La Barbera and Eddy Maddalena and Michael Soprano and Kevin Roitero and Gianluca Demartini and Davide Ceolin and Damiano Spina and Stefano Mizzaro},

url = {https://www.sciencedirect.com/science/article/pii/S0306457324001523},

doi = {10.1016/j.ipm.2024.103792},

issn = {0306-4573},

year  = {2024},

date = {2024-05-31},

urldate = {2024-05-31},

journal = {Information Processing & Management},

volume = {61},

number = {5},

pages = {103792},

abstract = {There is an important ongoing effort aimed to tackle misinformation and to perform reliable fact-checking by employing human assessors at scale, with a crowdsourcing-based approach. Previous studies on the feasibility of employing crowdsourcing for the task of misinformation detection have provided inconsistent results: some of them seem to confirm the effectiveness of crowdsourcing for assessing the truthfulness of statements and claims, whereas others fail to reach an effectiveness level higher than automatic machine learning approaches, which are still unsatisfactory. In this paper, we aim at addressing such inconsistency and understand if truthfulness assessment can indeed be crowdsourced effectively. To do so, we build on top of previous studies; we select some of those reporting low effectiveness levels, we highlight their potential limitations, and we then reproduce their work attempting to improve their setup to address those limitations. We employ various approaches, data quality levels, and agreement measures to assess the reliability of crowd workers when assessing the truthfulness of (mis)information. Furthermore, we explore different worker features and compare the results obtained with different crowds. According to our findings, crowdsourcing can be used as an effective methodology to tackle misinformation at scale. When compared to previous studies, our results indicate that a significantly higher agreement between crowd workers and experts can be obtained by using a different, higher-quality, crowdsourcing platform and by improving the design of the crowdsourcing task. Also, we find differences concerning task and worker features and how workers provide truthfulness assessments.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Baroni, Giulia Lucrezia; Rasotto, Laura; Roitero, Kevin; Tulisso, Angelica; Loreto, Carla Di; Mea, Vincenzo Della

Optimizing Vision Transformers for Histopathology: Pretraining and Normalization in Breast Cancer Classification Journal Article

In: Journal of Imaging, vol. 10, no 5, 2024, ISSN: 2313-433X.

Abstract | Links | BibTeX

Fiorin, Alessio; Pablo, Carlos López; Lejeune, Marylène; Siraj, Ameer Hamza; Mea, Vincenzo Della

Enhancing AI Research for Breast Cancer: A Comprehensive Review of Tumor-Infiltrating Lymphocyte Datasets Journal Article

In: Journal of Imaging Informatics in Medicine, 2024, ISSN: 2948-2933.

Abstract | Links | BibTeX

Soprano, Michael; Roitero, Kevin; Barbera, David La; Ceolin, Davide; Spina, Damiano; Demartini, Gianluca; Mizzaro, Stefano

Cognitive Biases in Fact-Checking and Their Countermeasures: A Review Journal Article

In: Information Processing & Management, vol. 61, no 3, pp. 103672, 2024, ISSN: 0306-4573.

Abstract | Links | BibTeX

@article{SOPRANO2024103672,

title = {Cognitive Biases in Fact-Checking and Their Countermeasures: A Review},

author = {Michael Soprano and Kevin Roitero and David La Barbera and Davide Ceolin and Damiano Spina and Gianluca Demartini and Stefano Mizzaro},

url = {https://www.sciencedirect.com/science/article/pii/S0306457324000323},

doi = {10.1016/j.ipm.2024.103672},

issn = {0306-4573},

year  = {2024},

date = {2024-02-11},

urldate = {2024-01-01},

journal = {Information Processing & Management},

volume = {61},

number = {3},

pages = {103672},

abstract = {The increase of the amount of misinformation spread every day online is a huge threat to the society. Organizations and researchers are working to contrast this misinformation plague. In this setting, human assessors are indispensable to correctly identify, assess and/or revise the truthfulness of information items, i.e., to perform the fact-checking activity. Assessors, as humans, are subject to systematic errors that might interfere with their fact-checking activity. Among such errors, cognitive biases are those due to the limits of human cognition. Although biases help to minimize the cost of making mistakes, they skew assessments away from an objective perception of information. Cognitive biases, hence, are particularly frequent and critical, and can cause errors that have a huge potential impact as they propagate not only in the community, but also in the datasets used to train automatic and semi-automatic machine learning models to fight misinformation. In this work, we present a review of the cognitive biases which might occur during the fact-checking process. In more detail, inspired by PRISMA – a methodology used for systematic literature reviews – we manually derive a list of 221 cognitive biases that may affect human assessors. Then, we select the 39 biases that might manifest during the fact-checking process, we group them into categories, and we provide a description. Finally, we present a list of 11 countermeasures that can be adopted by researchers, practitioners, and organizations to limit the effect of the identified cognitive biases on the fact-checking activity.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Baroni, Giulia L.; Rasotto, Laura; Roitero, Kevin; Siraj, Ameer Hamza; Mea, V. Della

Vision Transformers for Breast Cancer Histology Image Classification Proceedings Article

In: Foresti, Gian Luca; Fusiello, Andrea; Hancock, Edwin (Ed.): Image Analysis and Processing - ICIAP 2023 Workshops, pp. 15–26, Springer Nature Switzerland, Cham, 2024, ISBN: 978-3-031-51026-7.

Abstract | Links | BibTeX

Ros, Francesca Da; Gaspero, Luca Di; Roitero, Kevin; Barbera, David La; Mizzaro, Stefano; Mea, Vincenzo Della; Valent, Francesca; Deroma, Laura

Supporting Fair and Efficient Emergency Medical Services in a Large Heterogeneous Region Journal Article

In: Journal of Healthcare Informatics Research, 2024, ISSN: 2509-498X.

Abstract | Links | BibTeX

Demartini, Gianluca; Roitero, Kevin; Mizzaro, Stefano

Data Bias Management Journal Article

In: Commun. ACM, vol. 67, no 1, pp. 28–32, 2023, ISSN: 0001-0782.

Abstract | Links | BibTeX

Soprano, Michael; Roitero, Kevin; Mea, Vincenzo Della; Mizzaro, Stefano

Towards a Conversational-Based Agent for Health Services Proceedings Article

In: Falchi, Fabrizio; Giannotti, Fosca; Monreale, Anna; Boldrini, Chiara; Rinzivillo, Salvatore; Colantonio, Sara (Ed.): Proceedings of the Italia Intelligenza Artificiale - Thematic Workshops co-located with the 3rd CINI National Lab AIIS Conference on Artificial Intelligence, pp. 278–283, CEUR-WS.org, Pisa, Italy, 2023.

Abstract | Links | BibTeX

Barbera, David La; Soprano, Michael; Roitero, Kevin; Maddalena, Eddy; Mizzaro, Stefano

Fact-Checking at Scale with Crowdsourcing: Experiments and Lessons Learned Proceedings Article

In: Nardini, Franco Maria; Tonelotto, Nicola; Faggioli, Guglielmo; Ferrara, Antonio (Ed.): Proceedings of the 13th Italian Information Retrieval Workshop, pp. 85–90, CEUR-WS.org, Pisa, Italy, 2023.

Abstract | Links | BibTeX

Roitero, Kevin; Barbera, David La; Soprano, Michael; Demartini, Gianluca; Mizzaro, Stefano; Sakai, Tetsuya

How Many Crowd Workers Do I Need? On Statistical Power When Crowdsourcing Relevance Judgments Journal Article

In: ACM Transactions on Information Systems, 2023, ISSN: 1046-8188, (Journal Ranks: Journal Citation Reports (JCR) Q1 (2021), Scimago (SJR) Q1 (2021)).

Abstract | Links | BibTeX

Xie, Haoyu; Maddalena, Eddy; Qarout, Rehab; Checco, Alessandro

The Dark Side of Recruitment in Crowdsourcing: Ethics and Transparency in Micro-Task Marketplaces Journal Article

In: Computer Supported Cooperative Work (CSCW), vol. 32, no 3, pp. 439-474, 2023, ISSN: 1573-7551.

Abstract | Links | BibTeX

Maddalena, Eddy; Ibáñez, Luis-Daniel; Reeves, Neal; Simperl, Elena

Qrowdsmith: Enhancing Paid Microtask Crowdsourcing with Gamification and Furtherance Incentives Journal Article

In: ACM Trans. Intell. Syst. Technol., 2023, ISSN: 2157-6904, (Just Accepted).

Abstract | Links | BibTeX

Roitero, Kevin; Martinuzzi, Andrea; Armellin, Maria Teresa; Paparella, Gabriella; Maniero, Alberto; Mea, Vincenzo Della

Automated ICF Coding of Rehabilitation Notes for Low-Resource Languages via Continual Training of Language Models Journal Article

In: Studies in Health Technology and Informatics, vol. 302, pp. 763–767, 2023, ISSN: 1879-8365.

Abstract | Links | BibTeX

Ceolin, Davide; Roitero, Kevin; Guo, Furong

Predicting Crowd Workers Performance: An Information Quality Case Proceedings Article

In: Garrigós, Irene; Rodríguez, Juan Manuel Murillo; Wimmer, Manuel (Ed.): Web Engineering, pp. 75–90, Springer Nature Switzerland, Cham, 2023, ISBN: 978-3-031-34444-2.

Abstract | Links | BibTeX

Roitero, Kevin; Portelli, Beatrice; Serra, Giuseppe; Mea, Vincenzo Della; Mizzaro, Stefano; Cerro, Gianni; Vitelli, Michele; Molinara, Mario

Detection of Wastewater Pollution Through Natural Language Generation With a Low-Cost Sensing Platform Journal Article

In: IEEE Access, vol. 11, pp. 50272–50284, 2023, ISSN: 2169-3536.

Abstract | Links | BibTeX

@article{10129181,

title = {Detection of Wastewater Pollution Through Natural Language Generation With a Low-Cost Sensing Platform},

author = {Kevin Roitero and Beatrice Portelli and Giuseppe Serra and Vincenzo Della Mea and Stefano Mizzaro and Gianni Cerro and Michele Vitelli and Mario Molinara},

doi = {10.1109/ACCESS.2023.3277535},

issn = {2169-3536},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

journal = {IEEE Access},

volume = {11},

pages = {50272–50284},

abstract = {The detection of contaminants in several environments (e.g., air, water, sewage systems) is of paramount importance to protect people and predict possible dangerous circumstances. Most works do this using classical Machine Learning tools that act on the acquired measurement data. This paper introduces two main elements: a low-cost platform to acquire, pre-process, and transmit data to classify contaminants in wastewater; and a novel classification approach to classify contaminants in wastewater, based on deep learning and the transformation of raw sensor data into natural language metadata. The proposed solution presents clear advantages against state-of-the-art systems in terms of higher effectiveness and reasonable efficiency. The main disadvantage of the proposed approach is that it relies on knowing the injection time, i.e., the instant in time when the contaminant is injected into the wastewater. For this reason, the developed system also includes a finite state machine tool able to infer the exact time instant when the substance is injected. The entire system is presented and discussed in detail. Furthermore, several variants of the proposed processing technique are also presented to assess the sensitivity to the number of used samples and the corresponding promptness/computational burden of the system. The lowest accuracy obtained by our technique is 91.4%, which is significantly higher than the 81.0% accuracy reached by the best baseline method.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Amigó, Enrique; Deldjoo, Yashar; Mizzaro, Stefano; Bellogín, Alejandro

A unifying and general account of fairness measurement in recommender systems Journal Article

In: Information Processing & Management, vol. 60, no 1, pp. 103115, 2023, ISSN: 0306-4573.

Abstract | Links | BibTeX

@article{AMIGO2023103115,

title = {A unifying and general account of fairness measurement in recommender systems},

author = {Enrique Amigó and Yashar Deldjoo and Stefano Mizzaro and Alejandro Bellogín},

url = {https://www.sciencedirect.com/science/article/pii/S0306457322002163},

doi = {https://doi.org/10.1016/j.ipm.2022.103115},

issn = {0306-4573},

year  = {2023},

date = {2023-01-01},

journal = {Information Processing & Management},

volume = {60},

number = {1},

pages = {103115},

abstract = {Fairness is fundamental to all information access systems, including recommender systems. However, the landscape of fairness definition and measurement is quite scattered with many competing definitions that are partial and often incompatible. There is much work focusing on specific – and different – notions of fairness and there exist dozens of metrics of fairness in the literature, many of them redundant and most of them incompatible. In contrast, to our knowledge, there is no formal framework that covers all possible variants of fairness and allows developers to choose the most appropriate variant depending on the particular scenario. In this paper, we aim to define a general, flexible, and parameterizable framework that covers a whole range of fairness evaluation possibilities. Instead of modeling the metrics based on an abstract definition of fairness, the distinctive feature of this study compared to the current state of the art is that we start from the metrics applied in the literature to obtain a unified model by generalization. The framework is grounded on a general work hypothesis: interpreting the space of users and items as a probabilistic sample space, two fundamental measures in information theory (Kullback–Leibler Divergence and Mutual Information) can capture the majority of possible scenarios for measuring fairness on recommender system outputs. In addition, earlier research on fairness in recommender systems could be viewed as single-sided, trying to optimize some form of equity across either user groups or provider/procurer groups, without considering the user/item space in conjunction, thereby overlooking/disregarding the interplay between user and item groups. Instead, our framework includes the notion of statistical independence between user and item groups. We finally validate our approach experimentally on both synthetic and real data according to a wide range of state-of-the-art recommendation algorithms and real-world data sets, showing that with our framework we can measure fairness in a general, uniform, and meaningful way.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Fairness is fundamental to all information access systems, including recommender systems. However, the landscape of fairness definition and measurement is quite scattered with many competing definitions that are partial and often incompatible. There is much work focusing on specific – and different – notions of fairness and there exist dozens of metrics of fairness in the literature, many of them redundant and most of them incompatible. In contrast, to our knowledge, there is no formal framework that covers all possible variants of fairness and allows developers to choose the most appropriate variant depending on the particular scenario. In this paper, we aim to define a general, flexible, and parameterizable framework that covers a whole range of fairness evaluation possibilities. Instead of modeling the metrics based on an abstract definition of fairness, the distinctive feature of this study compared to the current state of the art is that we start from the metrics applied in the literature to obtain a unified model by generalization. The framework is grounded on a general work hypothesis: interpreting the space of users and items as a probabilistic sample space, two fundamental measures in information theory (Kullback–Leibler Divergence and Mutual Information) can capture the majority of possible scenarios for measuring fairness on recommender system outputs. In addition, earlier research on fairness in recommender systems could be viewed as single-sided, trying to optimize some form of equity across either user groups or provider/procurer groups, without considering the user/item space in conjunction, thereby overlooking/disregarding the interplay between user and item groups. Instead, our framework includes the notion of statistical independence between user and item groups. We finally validate our approach experimentally on both synthetic and real data according to a wide range of state-of-the-art recommendation algorithms and real-world data sets, showing that with our framework we can measure fairness in a general, uniform, and meaningful way.

Close

Amigó, Enrique; Gonzalo, Julio; Mizzaro, Stefano

What is My Problem? Identifying Formal Tasks and Metrics in Data Mining on the Basis of Measurement Theory Journal Article

In: IEEE Transactions on Knowledge and Data Engineering, vol. 35, no 2, pp. 2147–2157, 2023.

Links | BibTeX

Brand, Erik; Roitero, Kevin; Soprano, Michael; Rahimi, Afshin; Demartini, Gianluca

A Neural Model to Jointly Predict and Explain Truthfulness of Statements Journal Article

In: J. Data and Information Quality, 2022, ISSN: 1936-1955, (Just Accepted).

Abstract | Links | BibTeX

Qu, Yunke; Barbera, David La; Roitero, Kevin; Mizzaro, Stefano; Spina, Damiano; Demartini, Gianluca

Combining Human and Machine Confidence in Truthfulness Assessment Journal Article

In: J. Data and Information Quality, 2022, ISSN: 1936-1955, (Just Accepted).

Abstract | Links | BibTeX

Soprano, Michael; Roitero, Kevin; Bona, Francesco Bombassei De; Mizzaro, Stefano

Crowd_Frame: A Simple and Complete Framework to Deploy Complex Crowdsourcing Tasks Off-the-Shelf Proceedings Article

In: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining, pp. 1605–1608, Association for Computing Machinery, Virtual Event, AZ, USA, 2022, ISBN: 9781450391320.

Abstract | Links | BibTeX

Roitero, Kevin; Checco, Alessandro; Mizzaro, Stefano; Demartini, Gianluca

Preferences on a Budget: Prioritizing Document Pairs When Crowdsourcing Relevance Judgments Proceedings Article

In: Proceedings of the ACM Web Conference 2022, pp. 319–327, Association for Computing Machinery, Virtual Event, Lyon, France, 2022, ISBN: 9781450390965.

Abstract | Links | BibTeX

@inproceedings{10.1145/3485447.3511960,

title = {Preferences on a Budget: Prioritizing Document Pairs When Crowdsourcing Relevance Judgments},

author = {Kevin Roitero and Alessandro Checco and Stefano Mizzaro and Gianluca Demartini},

url = {https://doi.org/10.1145/3485447.3511960},

doi = {10.1145/3485447.3511960},

isbn = {9781450390965},

year  = {2022},

date = {2022-01-01},

booktitle = {Proceedings of the ACM Web Conference 2022},

pages = {319–327},

publisher = {Association for Computing Machinery},

address = {Virtual Event, Lyon, France},

series = {WWW '22},

abstract = {In Information Retrieval (IR) evaluation, preference judgments are collected by presenting to the assessors a pair of documents and asking them to select which of the two, if any, is the most relevant. This is an alternative to the classic relevance judgment approach, in which human assessors judge the relevance of a single document on a scale; such an alternative allows to make relative rather than absolute judgments of relevance. While preference judgments are easier for human assessors to perform, the number of possible document pairs to be judged is usually so high that it makes it unfeasible to judge them all. Thus, following a similar idea to pooling strategies for single document relevance judgments where the goal is to sample the most useful documents to be judged, in this work we focus on analyzing alternative ways to sample document pairs to judge, in order to maximize the value of a fixed number of preference judgments that can feasibly be collected. Such value is defined as how well we can evaluate IR systems given a budget, that is, a fixed number of human preference judgments that may be collected. By relying on several datasets featuring relevance judgments gathered by means of experts and crowdsourcing, we experimentally compare alternative strategies to select document pairs and show how different strategies lead to different IR evaluation result quality levels. Our results show that, by using the appropriate procedure, it is possible to achieve good IR evaluation results with a limited number of preference judgments, thus confirming the feasibility of using preference judgments to create IR evaluation collections.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Barbera, David La; Roitero, Kevin; Mackenzie, Joel; Damiano, Spina; Demartini, Gianluca; Mizzaro, Stefano

BUM at CheckThat! 2022: A Composite Deep Learning Approach to Fake News Detection using Evidence Retrieval Proceedings Article

In: andd Ferro Faggioli, Nicola Guglielmo; Hanbury, Allan; Potthast, Martin (Ed.): Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, 2022.

BibTeX

Draws, Tim; Barbera, David La; Soprano, Michael; Roitero, Kevin; Ceolin, Davide; Checco, Alessandro; Mizzaro, Stefano

The Effects of Crowd Worker Biases in Fact-Checking Tasks Proceedings Article

In: 2022 ACM Conference on Fairness, Accountability, and Transparency, pp. 2114–2124, Association for Computing Machinery, Seoul, Republic of Korea, 2022, ISBN: 9781450393522.

Abstract | Links | BibTeX

Ceschia, Sara; Roitero, Kevin; Demartini, Gianluca; Mizzaro, Stefano; Gaspero, Luca Di; Schaerf, Andrea

Task design in complex crowdsourcing experiments: Item assignment optimization Journal Article

In: Computers & Operations Research, pp. 105995, 2022, ISSN: 0305-0548.

Links | BibTeX

Ceolin, Davide; Primiero, Giuseppe; Soprano, Michael; Wielemaker, Jan

Transparent Assessment of Information Quality of Online Reviews Using Formal Argumentation Theory Journal Article

In: Information Systems, vol. 110, pp. 102107, 2022, ISSN: 0306-4379, (Journal Ranks: Journal Citation Reports (JCR) Q2 (2021), Scimago (SJR) Q1 (2021)).

Abstract | Links | BibTeX

@article{CEOLIN2022102107,

title = {Transparent Assessment of Information Quality of Online Reviews Using Formal Argumentation Theory},

author = {Davide Ceolin and Giuseppe Primiero and Michael Soprano and Jan Wielemaker},

doi = {10.1016/j.is.2022.102107},

issn = {0306-4379},

year  = {2022},

date = {2022-01-01},

journal = {Information Systems},

volume = {110},

pages = {102107},

abstract = {Review scores collect users’ opinions in a simple and intuitive manner. However, review scores are also easily manipulable, hence they are often accompanied by explanations. A substantial amount of research has been devoted to ascertaining the quality of reviews, to identify the most useful and authentic scores through explanation analysis. In this paper, we advance the state of the art in review quality analysis. We introduce a rating system to identify review arguments and to define an appropriate weighted semantics through formal argumentation theory. We introduce an algorithm to construct a corresponding graph, based on a selection of weighted arguments, their semantic distance, and the supported ratings. We also provide an algorithm to identify the model of such an argumentation graph, maximizing the overall weight of the admitted nodes and edges. We evaluate these contributions on the Amazon review dataset by McAuley et al. (2015), by comparing the results of our argumentation assessment with the upvotes received by the reviews. Also, we deepen the evaluation by crowdsourcing a multidimensional assessment of reviews and comparing it to the argumentation assessment. Lastly, we perform a user study to evaluate the explainability of our method, i.e., to test whether the automated method we use to assess reviews is understandable by humans. Our method achieves two goals: (1) it identifies reviews that are considered useful, comprehensible, and complete by online users, and does so in an unsupervised manner, and (2) it provides an explanation of quality assessments.},

note = {Journal Ranks: Journal Citation Reports (JCR) Q2 (2021), Scimago (SJR) Q1 (2021)},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Amigó, Enrique; Mizzaro, Stefano; Spina, Damiano

Ranking Interruptus: When Truncated Rankings Are Better and How to Measure That Proceedings Article

In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 588–598, Association for Computing Machinery, New York, NY, USA, 2022, ISBN: 9781450387323.

Abstract | Links | BibTeX

Brand, Erik; Roitero, Kevin; Soprano, Michael; Demartini, Gianluca

E-BART: Jointly Predicting and Explaining Truthfulness Proceedings Article

In: Augenstein, Isabelle; Papotti, Paolo; Wright, Dustin (Ed.): Proceedings of the 2021 Truth and Trust Online Conference (TTO 2021), Virtual, October 7-8, 2021, pp. 18–27, Hacks Hackers, 2021.

Abstract | Links | BibTeX

Roitero, Kevin; Soprano, Michael; Portelli, Beatrice; Luise, Massimiliano De; Spina, Damiano; Mea, Vincenzo Della; Serra, Giuseppe; Mizzaro, Stefano; Demartini, Gianluca

Can The Crowd Judge Truthfulness? A Longitudinal Study on Recent Misinformation About COVID-19 Journal Article

In: Personal and Ubiquitous Computing, 2021, ISSN: 1617-4917.

Abstract | Links | BibTeX

@article{journal-paper-puc-2021,

title = {Can The Crowd Judge Truthfulness? A Longitudinal Study on Recent Misinformation About COVID-19},

author = {Kevin Roitero and Michael Soprano and Beatrice Portelli and Massimiliano De Luise and Damiano Spina and Vincenzo Della Mea and Giuseppe Serra and Stefano Mizzaro and Gianluca Demartini},

url = {https://doi.org/10.1007/s00779-021-01604-6},

doi = {10.1007/s00779-021-01604-6},

issn = {1617-4917},

year  = {2021},

date = {2021-01-01},

journal = {Personal and Ubiquitous Computing},

abstract = {Recently, the misinformation problem has been addressed with a crowdsourcing-based approach: to assess the truthfulness of a statement, instead of relying on a few experts, a crowd of non-expert is exploited. We study whether crowdsourcing is an effective and reliable method to assess truthfulness during a pandemic, targeting statements related to COVID-19, thus addressing (mis)information that is both related to a sensitive and personal issue and very recent as compared to when the judgment is done. In our experiments, crowd workers are asked to assess the truthfulness of statements, and to provide evidence for the assessments. Besides showing that the crowd is able to accurately judge the truthfulness of the statements, we report results on workers' behavior, agreement among workers, effect of aggregation functions, of scales transformations, and of workers background and bias. We perform a longitudinal study by re-launching the task multiple times with both novice and experienced workers, deriving important insights on how the behavior and quality change over time. Our results show that workers are able to detect and objectively categorize online (mis)information related to COVID-19; both crowdsourced and expert judgments can be transformed and aggregated to improve quality; worker background and other signals (e.g., source of information, behavior) impact the quality of the data. The longitudinal study demonstrates that the time-span has a major effect on the quality of the judgments, for both novice and experienced workers. Finally, we provide an extensive failure analysis of the statements misjudged by the crowd-workers.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Soprano, Michael; Roitero, Kevin; Barbera, David La; Ceolin, Davide; Spina, Damiano; Mizzaro, Stefano; Demartini, Gianluca

The Many Dimensions of Truthfulness: Crowdsourcing Misinformation Assessments on a Multidimensional Scale Journal Article

In: Information Processing & Management, vol. 58, no 6, pp. 102710, 2021, ISSN: 0306-4573.

Abstract | Links | BibTeX

@article{journal-paper-ipm-2021,

title = {The Many Dimensions of Truthfulness: Crowdsourcing Misinformation Assessments on a Multidimensional Scale},

author = {Michael Soprano and Kevin Roitero and David La Barbera and Davide Ceolin and Damiano Spina and Stefano Mizzaro and Gianluca Demartini},

url = {https://www.sciencedirect.com/science/article/pii/S0306457321001941},

doi = {https://doi.org/10.1016/j.ipm.2021.102710},

issn = {0306-4573},

year  = {2021},

date = {2021-01-01},

journal = {Information Processing & Management},

volume = {58},

number = {6},

pages = {102710},

abstract = {Recent work has demonstrated the viability of using crowdsourcing as a tool for evaluating the truthfulness of public statements. Under certain conditions such as: (1) having a balanced set of workers with different backgrounds and cognitive abilities; (2) using an adequate set of mechanisms to control the quality of the collected data; and (3) using a coarse grained assessment scale, the crowd can provide reliable identification of fake news. However, fake news are a subtle matter: statements can be just biased (“cherrypicked”), imprecise, wrong, etc. and the unidimensional truth scale used in existing work cannot account for such differences. In this paper we propose a multidimensional notion of truthfulness and we ask the crowd workers to assess seven different dimensions of truthfulness selected based on existing literature: Correctness, Neutrality, Comprehensibility, Precision, Completeness, Speaker’s Trustworthiness, and Informativeness. We deploy a set of quality control mechanisms to ensure that the thousands of assessments collected on 180 publicly available fact-checked statements distributed over two datasets are of adequate quality, including a custom search engine used by the crowd workers to find web pages supporting their truthfulness assessments. A comprehensive analysis of crowdsourced judgments shows that: (1) the crowdsourced assessments are reliable when compared to an expert-provided gold standard; (2) the proposed dimensions of truthfulness capture independent pieces of information; (3) the crowdsourcing task can be easily learned by the workers; and (4) the resulting assessments provide a useful basis for a more complete estimation of statement truthfulness.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

Ceolin, Davide; Primiero, Giuseppe; Wielemaker, Jan; Soprano, Michael

Assessing the Quality of Online Reviews Using Formal Argumentation Theory Proceedings Article

In: Brambilla, Marco; Chbeir, Richard; Frasincar, Flavius; Manolescu, Ioana (Ed.): Web Engineering, pp. 71–87, Springer International Publishing, Cham, 2021, ISBN: 978-3-030-74296-6.

Abstract | Links | BibTeX

@inproceedings{10.1007/978-3-030-74296-6_6,

title = {Assessing the Quality of Online Reviews Using Formal Argumentation Theory},

author = {Davide Ceolin and Giuseppe Primiero and Jan Wielemaker and Michael Soprano},

editor = {Marco Brambilla and Richard Chbeir and Flavius Frasincar and Ioana Manolescu},

doi = {10.1007/978-3-030-74296-6_6},

isbn = {978-3-030-74296-6},

year  = {2021},

date = {2021-01-01},

booktitle = {Web Engineering},

pages = {71--87},

publisher = {Springer International Publishing},

address = {Cham},

abstract = {Review scores collect users' opinions in a simple and intuitive manner. However, review scores are also easily manipulable, hence they are often accompanied by explanations. A substantial amount of research has been devoted to ascertaining the quality of reviews, to identify the most useful and authentic scores through explanation analysis. In this paper, we advance the state of the art in review quality analysis. We introduce a rating system to identify review arguments and to define an appropriate weighted semantics through formal argumentation theory. We introduce an algorithm to construct a corresponding graph, based on a selection of weighted arguments, their semantic similarity, and the supported ratings. We provide an algorithm to identify the model of such an argumentation graph, maximizing the overall weight of the admitted nodes and edges. We evaluate these contributions on the Amazon review dataset by McAuley et al. [15], by comparing the results of our argumentation assessment with the upvotes received by the reviews. Also, we deepen the evaluation by crowdsourcing a multidimensional assessment of reviews and comparing it to the argumentation assessment. Lastly, we perform a user study to evaluate the explainability of our method. Our method achieves two goals: (1) it identifies reviews that are considered useful, comprehensible, truthful by online users and does so in an unsupervised manner, and (2) it provides an explanation of quality assessments.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Qu, Yunke; Roitero, Kevin; Mizzaro, Stefano; Spina, Damiano; Demartini, Gianluca

Human-in-the-Loop Systems for Truthfulness: A Study of Human and Machine Confidence Proceedings Article

In: Augenstein, Isabelle; Papotti, Paolo; Wright, Dustin (Ed.): Proceedings of the 2021 Truth and Trust Online Conference (TTO 2021), Virtual, October 7-8, 2021, pp. 40–49, Hacks Hackers, 2021.

Links | BibTeX

Roitero, Kevin; Portelli, Beatrice; Popescu, Mihai Horia; Mea, Vincenzo Della

DiLBERT: Cheap Embeddings for Disease Related Medical NLP Journal Article

In: IEEE Access, vol. 9, pp. 159714-159723, 2021.

Links | BibTeX

Demartini, Gianluca; Roitero, Kevin; Mizzaro, Stefano

Managing Bias in Human-Annotated Data: Moving Beyond Bias Removal Journal Article

In: CoRR, vol. abs/2110.13504, 2021.

Links | BibTeX

Conde-Sousa, Eduardo; Vale, João; Feng, Ming; Xu, Kele; Wang, Yin; Mea, Vincenzo Della; Barbera, David La; Montahaei, Ehsan; Baghshah, Mahdieh Soleymani; Turzynski, Andreas; Gildenblat, Jacob; Klaiman, Eldad; Hong, Yiyu; Aresta, Guilherme; Araújo, Teresa; Aguiar, Paulo; Eloy, Catarina; Polónia, António

HEROHE Challenge: assessing HER2 status in breast cancer without immunohistochemistry or in situ hybridization Miscellaneous

2021.

Links | BibTeX

Barbera, David La; Roitero, Kevin; Mizzaro, Stefano; Mea, Vincenzo Della; Valent, Francesca

A Software Simulator for Optimizing Ambulance Location and Response Time: A Preliminary Report Proceedings Article

In: 2021 IEEE International Conference on Digital Health (ICDH), pp. 209-211, 2021.

Links | BibTeX

Barbera, David La; Polónia, António; Roitero, Kevin; Conde-Sousa, Eduardo; Mea, Vincenzo Della

Detection of HER2 from Haematoxylin-Eosin Slides Through a Cascade of Deep Learning Classifiers via Multi-Instance Learning Journal Article

In: Journal of Imaging, vol. 6, no 9, 2020, ISSN: 2313-433X.

Abstract | Links | BibTeX