Use _py-xgboost-mutex to differentiate libxgboost CPU vs GPU #23

ksangeek · 2019-03-18T16:34:07Z

I see that libxgboost conda package also provides the CLI for xgboost, but I don't see a method for a user to select a CPU or a GPU variant of this.
Should there be a meta-packages named libxgboost-cpu and libxgboost-gpu to help users selectively install the variant they desire?

And also maybe libxgboost should have a dependency on the selection package _py-xgboost-mutex so that the CPU and the GPU variants are not wrongly installed?

@jjhelmus Pulling you in here for comments, based on your comments in #14

The text was updated successfully, but these errors were encountered:

beckermr · 2019-03-18T16:59:48Z

Ack. I thought we had this right.

beckermr · 2019-03-18T17:47:42Z

So conda does not ship the GPU variant.

We do depend on the mutex package to make sure the env is consistent. https://github.com/conda-forge/xgboost-feedstock/blob/master/recipe/meta.yaml#L70

Do you have an example of how this is going wrong?

beckermr · 2019-03-18T19:05:58Z

OK. So we cannot do this because the R package also depends on libxgboost. I am confused as to how you got an env to this state. You should not install libxgboost directly but instead you need to install one of the frontend packages that have the mutex dependencies.

ksangeek · 2019-03-19T03:52:51Z

I see that a user could install libxgboost directly and I don't see anything to restrict a user from doing that e.g. https://anaconda.org/conda-forge/libxgboost. I think you might be referring to that it is not common for a user to just install libxgboost conda package? Or am I missing something here?
My concern was only for the case when someone would just install libxgboost directly (just wants the xgboost CLI).

beckermr · 2019-03-19T03:59:57Z

Installing libxgboost directly is not how the package is supposed to be used, but yes of course people can do what they please.

I wonder if we should be separating the CLI from the lib package? Then we could use the mutex as it is supposed to work.

ksangeek · 2019-03-19T13:45:40Z

Yes, separating the CLI from libxgboost conda package would take care of the concern I brought up. Based on the discussion here, it seems that it is not expected for anyone to just install libxgboost and so is not a major concern anyway!
Thank You @beckermr for spending time on this.

beckermr · 2019-03-19T13:56:05Z

Happy to chat! I am going to leave this issue open for now.

jakirkham · 2019-05-02T21:34:08Z

Would it make sense to have a single mutex for all of these packages as they will all be dependent on libxgboost for CPU or GPU support?

beckermr · 2019-05-02T21:40:32Z

When you say "single mutex," what do you mean?

jakirkham · 2019-05-02T22:06:11Z

Sorry a single package to select GPU or CPU builds. Right now I see _py-xgboost-mutex and _r-xgboost-mutex. Am wondering if we can just have _libxgboost-mutex, which aligns C++, Python, and R. Does that make more sense?

beckermr · 2019-05-02T23:16:33Z

Ah ok. Does having a CPU python version installed in the env mean the GPU one cannot be? I don't know the internals well enough to understand the constraints here.

jakirkham · 2019-05-03T00:41:12Z

My guess is all packages using libxgboost need to be either CPU or GPU. That said, I don't know if the CPU/GPU parts extend outside of libxgboost or if they are contained within a common public APIs at the C++ level. If it's the latter, the Python and R bindings may be no different whether they are built to target CPU or GPU as only libxgboost would be affected. @RAMitchell, could you please provide us some guidance here?

RAMitchell · 2019-05-03T00:58:59Z

None of the Python code is different, the binary compiled for GPU use can safely be used on a system without a GPU. It will only fail if a user tries to call some GPU function and the cuda runtime does not exist.

beckermr · 2019-05-03T01:06:56Z

so we can just compile in gpu support for everything? why did we have the mutex in the first place? I copied it out of the main anaconda channels.

jakirkham · 2019-05-03T01:32:34Z

Do you happen to remember the author of these changes (to the recipe)? Maybe we can ask them?

beckermr · 2019-05-03T01:55:24Z

looks like it was @jjhelmus and/or @mingwandroid

ksangeek · 2019-05-03T04:28:27Z

There are a few things I'd like to add here -

From a conda packaging perspective, the GPU variant would need to depend on cudatoolkit for build and run as used here -
https://github.com/AnacondaRecipes/xgboost-feedstock/blob/885f8b2583def2146d494254d04ce05456704d3b/recipe/meta.yaml#L47
But the CPU variant does not need that!
I have seen the objects built for libxgboost GPU variant (which includes objects for CUDA code) make the library very bulky. So, regardless of point 1. I think it is still a good idea to have separate conda packages for the GPU and CPU variant.
Ref -
https://github.com/AnacondaRecipes/xgboost-feedstock/blob/885f8b2583def2146d494254d04ce05456704d3b/recipe/build.sh#L16-L19

While I had opened this issue, the idea was very close to what @jakirkham brought up here wrt to R. But my request was to include the dependency of - {{ pin_subpackage('_py-xgboost-mutex', exact=True) }} in the run/host section of libxgboost package, so that the CPU and GPU versions are exclusive. Installing the GPU variant (on a system where CPU variant is already installed) should prompt to clear off the conflicting packages (which includes libxgboost). In the current form, this recipe will not prompt to remove libxgboost.

xgboost-feedstock/recipe/meta.yaml

Lines 36 to 49 in 521defb

    
           outputs: 
        
             - name: libxgboost 
        
               script: install-libxgboost.sh 
        
               requirements: 
        
                 build: 
        
                   - {{ compiler('c') }} 
        
                   - {{ compiler('cxx') }} 
        
                   - git 
        
                   - make 
        
                   - llvm-openmp >=4.0.1  # [osx] 
        
                 host: 
        
                   - llvm-openmp >=4.0.1  # [osx] 
        
                 run: 
        
                   - llvm-openmp >=4.0.1  # [osx]

And as @jakirkham brought up I am supportive of the idea to have a single mutex _xgboost-mutex, which aligns C++, Python, and R.

mingwandroid · 2019-05-03T08:11:27Z

If the gpu xgboost works just as well without gpus as the non-gpu xgboost then there's no good enough reason for the cpu variant to exist.

Let's discard all idea of mutexes then?

jakirkham · 2019-05-03T20:46:51Z

Personally would be fine with either keeping one mutex or dropping.

It sounds to me like @ksangeek's concerns are basically about total install size, which seems like a valid concern to me (note: this does come up in other contexts). Do we have a sense of what the size difference is now?

jakirkham · 2019-10-25T01:42:01Z

Have added PR ( #35 ) to try and simplify CPU/GPU selection based on the discussion here. Please take a look and share your thoughts. 🙂

jakirkham · 2022-08-11T04:40:40Z

We've since added GPU/CPU packages here. There may very well still be work needed, but please try it out and let us know

…tream_3 Merge `conda-forge/main` into `rapidsai/main` (pt. 3)

hcho3 · 2024-07-31T21:39:06Z

@jakirkham Can we close this out now?

jakirkham · 2024-07-31T21:40:21Z

Yes this is done

If we want to make other changes, we should start a new issue discussing those

jakirkham mentioned this issue Oct 25, 2019

[WIP] Use one metapackage for CPU/GPU #35

Draft

5 tasks

jakirkham added a commit to jakirkham-feedstocks/xgboost-feedstock that referenced this issue Aug 5, 2023

Merge pull request conda-forge#23 from jakirkham-feedstocks/merge_ups…

d715316

…tream_3 Merge `conda-forge/main` into `rapidsai/main` (pt. 3)

jakirkham closed this as completed Jul 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use _py-xgboost-mutex to differentiate libxgboost CPU vs GPU #23

Use _py-xgboost-mutex to differentiate libxgboost CPU vs GPU #23

ksangeek commented Mar 18, 2019 •

edited

Loading

beckermr commented Mar 18, 2019

beckermr commented Mar 18, 2019

beckermr commented Mar 18, 2019

ksangeek commented Mar 19, 2019 •

edited

Loading

beckermr commented Mar 19, 2019

ksangeek commented Mar 19, 2019

beckermr commented Mar 19, 2019

jakirkham commented May 2, 2019

beckermr commented May 2, 2019

jakirkham commented May 2, 2019

beckermr commented May 2, 2019

jakirkham commented May 3, 2019

RAMitchell commented May 3, 2019

beckermr commented May 3, 2019

jakirkham commented May 3, 2019 •

edited

Loading

beckermr commented May 3, 2019

ksangeek commented May 3, 2019 •

edited

Loading

mingwandroid commented May 3, 2019

jakirkham commented May 3, 2019

jakirkham commented Oct 25, 2019

jakirkham commented Aug 11, 2022

hcho3 commented Jul 31, 2024

jakirkham commented Jul 31, 2024

Use _py-xgboost-mutex to differentiate libxgboost CPU vs GPU #23

Use _py-xgboost-mutex to differentiate libxgboost CPU vs GPU #23

Comments

ksangeek commented Mar 18, 2019 • edited Loading

beckermr commented Mar 18, 2019

beckermr commented Mar 18, 2019

beckermr commented Mar 18, 2019

ksangeek commented Mar 19, 2019 • edited Loading

beckermr commented Mar 19, 2019

ksangeek commented Mar 19, 2019

beckermr commented Mar 19, 2019

jakirkham commented May 2, 2019

beckermr commented May 2, 2019

jakirkham commented May 2, 2019

beckermr commented May 2, 2019

jakirkham commented May 3, 2019

RAMitchell commented May 3, 2019

beckermr commented May 3, 2019

jakirkham commented May 3, 2019 • edited Loading

beckermr commented May 3, 2019

ksangeek commented May 3, 2019 • edited Loading

mingwandroid commented May 3, 2019

jakirkham commented May 3, 2019

jakirkham commented Oct 25, 2019

jakirkham commented Aug 11, 2022

hcho3 commented Jul 31, 2024

jakirkham commented Jul 31, 2024

ksangeek commented Mar 18, 2019 •

edited

Loading

ksangeek commented Mar 19, 2019 •

edited

Loading

jakirkham commented May 3, 2019 •

edited

Loading

ksangeek commented May 3, 2019 •

edited

Loading