Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ECLSK Data produces Nans and/or no correlation in HMA #2176

Open
awesomeisfree opened this issue Aug 8, 2024 · 0 comments
Open

ECLSK Data produces Nans and/or no correlation in HMA #2176

awesomeisfree opened this issue Aug 8, 2024 · 0 comments
Assignees
Labels
new Automatic label applied to new issues question General question about the software

Comments

@awesomeisfree
Copy link

Environment details

If you are already running SDV, please indicate the following details about the environment in
which you are running it:

  • SDV version:1.15
  • Python version:3.10.12
  • Operating System:Windows 10 (Google colab)

Problem description

When attempting to fit and synthesize data from the pub data ECLSK dataset (attached and here: https://nces.ed.gov/ecls/), several strange outcomes occur, most notably regarding the OUTCOME column, which either all comes out as one value or produces NaNs. There does not appear to be anything interesting about that column. Please advise.

What I already tried

Adjusting column dtypes, culling the dataset to fewer columns

link to colab:
https://colab.research.google.com/drive/1pT81wxCReMNxam3ZP-6u3IM74R_0Czgh#scrollTo=YN16L5Ywcbou

children (1).csv
ECLSKdata (1).csv
schools (1).csv

@awesomeisfree awesomeisfree added new Automatic label applied to new issues question General question about the software labels Aug 8, 2024
@npatki npatki self-assigned this Aug 12, 2024
@srinify srinify assigned srinify and unassigned npatki Sep 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new Automatic label applied to new issues question General question about the software
Projects
None yet
Development

No branches or pull requests

3 participants