SIGEcom Exchanges Annotated Reading List: Multiclass Calibration

doi:10.1145/3817099.3817107

DOI: 10.1145/3817099.3817107 ISSN: 1551-9031

SIGEcom Exchanges Annotated Reading List: Multiclass Calibration

Rabanus Derr, Jessie Finocchiaro

ML model evaluation often takes one of two main approaches: risk minimization , associated with "high accuracy" or calibration , meaning that predictions are "trustworthy" and can be interpreted from a probabilistic lens. There is an extensive line of work which has studied the relationship between risk minimization and calibration, mostly focusing on the binary outcome setting. Even in the binary setting, there are a variety of proposed calibration metrics which non-trivially interact. In the multiclass label setting, the choices to be made are even more complex and particularly there are different semantics for different notions. Here, we briefly present an annotated reading list reviewing some of the proposed definitions and their relationships.

More from our Archive

DOI: 10.1242/jeb.252227 2026
When repair mechanisms fail to keep up: high UVB irradiance causes disproportionate accumulation of DNA lesions
Niclas U. Lundsgaard, Craig E. Franklin, Rebecca L. Cramp
DOI: 10.1177/23996544261466050 2026
Writing against erasure: A geography of resistance in Gaza
Lubna Ahmad Abu Sitta
DOI: 10.1148/rg.250085 2026
Early Pancreatic Cancer: Clinical Implications, Workup, and Imaging Findings with Histopathologic Correlation for Personalized Surveillance
Shintaro Kano, Wataru Gonoi, Moto Nakaya, Shohei Inui, Yudai Nakai, Sota Masuoka, Tomohiko Masumoto, Manabu Minami, Ayman H. Gaballah, Osamu Abe
DOI: 10.1017/pds.2026.10666 2026
Challenges in understanding, using, and teaching design methods: perspectives of design educators
Mayank Mayookh, V. Srinivasan
DOI: 10.1136/bmj-2026-100016 2026
Venous thromboembolism after mechanical restraint in psychiatric hospitals: population based cohort and self-controlled case series study
Jakob Hansen Viuff, Lars Pedersen, Irene Petersen, Jan P Vandenbroucke, Søren Dinesen Østergaard, Henrik Toft Sørensen
DOI: 10.1097/olq.0000000000002356 2026
The Potential for Combined Treponemal/Nontreponemal Rapid Point-of-Care Test and Treponema pallidum Polymerase Chain Reaction in the Diagnosis of Gestational and Congenital Syphilis in a Low-Resource, High-Prevalence Setting: Pilot Data From Malawi
Deirdre J Foley, Vita Nyasulu, Chifundo Kondoni, Annie Kuyere, Fatima Mtonga, George Shaba, James Jafali, Chelsea Morroni, Michael Marks, Patrick Mallon, David Lissauer, Gladys Gadama, Luis Gadama, Kondwani Kawaza, Charlotte van der Veer, Bridget Freyne
DOI: 10.1097/olq.0000000000002347 2026
Advancements in Syphilis Vaccine Development
Lorenzo Giacani, Caroline E. Cameron, Feijun Zhao, Melissa J. Caimano, Justin D. Radolf
DOI: 10.1097/olq.0000000000002353 2026
Evaluation of Partner Notification Strategies to Improve Syphilis Management in Pregnancy in Blantyre, Malawi: A Mixed-Methods Study
Kondwani Kaitume Kaunda, Deirdre J. Foley, Michael Marks, Annielisa Majamanda, Monica Patricia Malata, Catherine Bamuya, Chifundo Kondoni, Gladys Membe Gadama, David Lissauer, Chelsea Morroni, Peter MacPherson, Effie Chipeta, Linda Mipando, Brynne Gilmore, Bridget Freyne
DOI: 10.1097/olq.0000000000002331 2026
Potential Strategies for Participation and Community Engagement in Syphilis Clinical Research
Mitch M. Matoga, Suzanne Day, Dan Wu, Zhuoheng Yin, Bolin Cao, Zou Huachun, Barbara Van Der Pol, Joseph D. Tucker
DOI: 10.1097/olq.0000000000002349 2026
Key Considerations in Evaluating Syphilis Therapeutics
Lisa Frigati, Laurens Manning, Michael Marks, Oriol Mitjà, Thomas Fitzpatrick, Pingyu Zhou