peerannot

–Handling your crowdsourced datasets easily–

Python 3.8+ PyPI version

The peerannot library was created to handle crowdsourced labels in classification problems.

Getting started

Start here to get up and running

Tutorials and additional examples

Want to dive deeper into the library? Check out the tutorials You will find resources to add your own datasets, strategy, and run your first label aggregations.

API and CLI Reference

Want to deep dive into the library? In addition to the tutorials, you can find the full API and CLI reference here.

Run peerannot from a python script

API Reference
API Reference

Run peerannot from your terminal

CLI Reference
CLI interface

Glossary

Name

Definition

Mathematical Definition

ntask

The total number of tasks in a dataset

nworker

The total number of workers in a dataset

[K]

The set of labels a task can take

[K]={1,...,K}

ΔK

The simplex of dimension K1, used to represent soft labels (ie. labels as a probability vector along [K])

ΔK={p[K]:k=1Kpk=1,pk0}

A(xi)

The set of workers that answered the task i

{j[nworker:wj answered xi}

T(wj)

The set of tasks answered by the worker j

{i[ntask:wj answered xi}

Lab(xi)

The vector of answered labels of the task i

(yi(j))jA(xi)

yi

The true label of the task i

yi[K]

y^iagg

The computed label of the task i given the aggregation agg method

{y^iagg[K] if a hard labely^iaggΔK if a soft label

yi(j)

The label (hard) that the worker j assigned to the task i

π(j)

The confusion matrix of the worker j

πk,(j)=P(yi(j)=yi=k),(,k)[K]2

AccTrain(D)

A metric that measure aggregation strategies’ accuracies

AccTrain(D)=1|D|i=1|D|1{yi=argmaxk[K](ŷi)k}

Citation

Cite us, join us, and let us collaboratively improve our toolbox!

@article{lefort2024,
   author = {Lefort, Tanguy and Charlier, Benjamin and Joly, Alexis and Salmon, Joseph},
   publisher = {French Statistical Society},
   title = {Peerannot: Classification for Crowdsourced Image Datasets with {Python}},
   journal = {Computo},
   date = {2024-04-04},
   url = {https://computo.sfds.asso.fr/published-202402-lefort-peerannot/},
}