New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add "target" variable to ProfileReport and then add more graphs #73
Comments
How is this better than what's already there? If the column is categorical you'll get that histogram. |
countplot and histogram are different things, we may see different distributions in countplot |
I think what @eamag meant is that with a countplot you can compare side-by-side the distribution of different categorical fields, where the Y is the count. See https://seaborn.pydata.org/tutorial/categorical.html One way to implement this feature is to generate a countplot on each categorical field against every other one. I'm sure this will hurt the performance and won't give meaningful value to the user. In an exploratory process, usually, you need to choose rationally which categorical fields you want to compare (like the Titanic example in the above link). Creating plots by comparing all categorical fields, like A vs B, B vs C, C vs A (2-fields) or A vs B vs C (3-fields) will create an exponential amount of plots (because it is a combinatory analysis). In my opinion, we shouldn't implement this feature. Best |
Hello, I still keep it open it could be studied. |
I agree this would be a great feature
I do not understand your point, is it really heavier to plot instead of plotting ? |
Stale issue |
https://seaborn.pydata.org/generated/seaborn.countplot.html is very useful if target is categorical. so it'd be nice to plot it if necessary
The text was updated successfully, but these errors were encountered: