Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Non-matching results in the DDR dataset #1

Open
CristianoPatricio opened this issue Nov 25, 2024 · 1 comment
Open

Non-matching results in the DDR dataset #1

CristianoPatricio opened this issue Nov 25, 2024 · 1 comment

Comments

@CristianoPatricio
Copy link

Hi there,

Congratulations on this awesome work.
I'm trying to reproduce the results of Table II and III of the paper, specifically the results for the DDR dataset. After running over the 10-folds with the default parameters from the config files, the results I obtained deviate a little bit from the reported ones in the paper. I expected my results match the line "CLAT(Ours)" of both Tables for the DDR-subset column. Is there any other configuration for the DDR dataset?

Thanks in advance.

Cristiano

@Sorades
Copy link
Owner

Sorades commented Nov 25, 2024

Thank you for your interest in our work and for taking the time to reproduce our results!

There is no other specific configuration for the DDR dataset. I suspect that the deviations might be due to the training-time intervention, which is a trick borrowed from CEM) that helps with test-time intervention. We observed in our experiments that this strategy can occasionally affect model performance. You can try setting training_int_prob to 0 to disable it and see if that helps align your results with the reported ones.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants