Data
The tool will visualize the position of celestial objects in a specific cordinate plane of the celestial sphere, allowing users to filter results based on redshift, telescope band (u, g, r i, z), and type of celestial object. The user will see the characteristics of the visualized points by hovering/clicking on them. With visual access, anyone studying these points in space will be able to more accurately and confidently make assumptions about the characteristics of certain areas of space and perform more in-depth research after using the tool.
The dataset was posted on Kaggle by Lennart Grosser, a machine learning engineer from Berlin, Germany. The dataset contains 10,000 rows of observations, each having 17 numerical columns and one class column and is a subset of the SDSS dataset released, that were taken by the Sloan Digital Sky Survey (SDSS). The SDSS provides us with the information needed to learn and discover more about our universe. This exact data table was creating by querying the CasJobs database, joining the photometric and spectral data tables published by the SDSS. The dataset we worked with can be found at
this link.
Since this data was collected from observations of our universe, there are not any apparent ethical issues that arise. When it comes to biases in the data, the subset of the SDSS dataset provided on Kaggle may not have been fully representative of the true distribution of observations across the universe. Since we do not know if the subset of data was chosen at random, or handpicked, we do not now how widely distributed the values are for the true scale, but should still be able to provide us with sufficient information.