Minor issues in math.evaluation

I found some minor issues in math.evaluation:
- The Wikipedia article linked in the r_precision [docstring](https://github.com/jina-ai/docarray/blob/944b91b3f20301ceb5c79690122e9bf292286e3f/docarray/math/evaluation.py#L14-L28) does not seems to be correct. According to the article the metric is calculated based on the Top-N results, where N is determined by the number of relevant documents in the dataset. However, the implementation sets N as the last position of a relevant document as done [here](https://gist.github.com/bwhite/3726239#file-rank_metrics-py-L40-L66). However, all descriptions of R-Precision which I found explain the metric like in the Wikipedia article. So it should be changed to behave like this
- The ndcg metric calculates the dcg score relative to the optimal dcg score. Basically, the implemention is correct, however, in most cases it is wrong to apply it in docarray.
The problem is, that optimal value needs to be calculated based on all possible results, however if I apply docarrays match function only the top-k ranking is returned. Therefore, the optimal top-value calculated by our implementation is wrong, e.g., if none of the top-20 results is relevant, the maximum score is 0, however, the actual maximum score might be higher because there relevant scores outside of the top-20. This should be at least explained in the documentation
- The R-Precision also requires to know the whole ranking. So this should also be noted in the documentation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor issues in math.evaluation #618

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Minor issues in math.evaluation #618

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions