catalog CSV download filtering on planet_status consistency problem
"other" is not equivalent of "controversial or unconfirmed or retracted"
This has been discovered by Eleonora Alei while she was updating her code that use our download pseudo API :
Hello!
Both urls seem to be working fine and I can definitely download the catalog with no problem now. One small thing that I noticed is that, actually, the two urls don't give the same results as of now. Here is the planet_status counts of the catalog obtained with the shorter url (that uses Other, called "dat") and the longer command (that specifies the statuses, called "dat2")
dat.planet_status.value_counts()
Out[17]:
Confirmed 5538
Candidate 2755
Name: planet_status, dtype: int64
dat2.planet_status.value_counts()
Out[18]:
Confirmed 5538
Candidate 2755
Retracted 53
Controversial 48
Name: planet_status, dtype: int64I can use the longer URL with no problems, I just thought you might want to know about this.
Thank you so very much for your help!
Eleonora Alei
It was a followup of @pmartin message giving 2 theorically equivalent urls to download a filtered version of the catalog :
Hello again,
I should have anticipated the anti-phishing filtering... but here is the corrected url for you to test :
https://exoplanet.eu/catalog/csv/?query_f=planet_status%3D%22confirmed%22%20or%20planet_status%3D%22candidate%22%20or%20planet_status%3D%22unconfirmed%22%20or%20planet_status%3D%22controversial%22%20or%20planet_status%3D%22retracted%22
But as a side note you can now use the planet_status "other" that is equivalent of "controversial or unconfirmed or retracted" giving you the shorter but equivalent url :
https://exoplanet.eu/catalog/csv/?query_f=planet_status%3D%22confirmed%22%20or%20planet_status%3D%22candidate%22%20or%20planet_status%3D%22other%22
Have a nice day.
Pierre-Yves "PYM" Martin
exoplanet.eu lead dev.