-
-
Notifications
You must be signed in to change notification settings - Fork 25.9k
[MRG] Fix Improvement in RANSAC: is_data_valid should receive subset_idxs #19498
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
This solution is buggy because your subset of indices is based on the One possibility is to propagate However, I don't understand why these samples should be included in the original |
Please check my comment in the issue. |
So in the end, it is not that you want to reject samples but you want to enforce some constraints on the resampling. I would think that we could add a new parameter (e.g. |
Yeap, now you got it. choose random sample setsubset_idxs = sample_without_replacement(n_samples, min_samples, random_state=random_state) |
The grouping sampling could be an option and we could also accept some callables. |
That sounds nice. If you think that this interface is better then lets do that. |
The contract of scikit-learn estimator is that at |
I think that this is not similar. I would personally go for an extra attribute in the init of the class that would define sampling strategy. |
For sure, you will need to add the parameter in Basically, what would be interesting is to know what are the other type of sampling that one can need. |
Reference Issues/PRs
Fixes #19495
What does this implement/fix? Explain your changes.
Any other comments?