Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FP8] rebase main #5963

Merged
merged 132 commits into from
Aug 6, 2024

Commits on Jun 24, 2024

  1. add SimPO

    YeAnbang committed Jun 24, 2024
    Configuration menu
    Copy the full SHA
    82aecd6 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4b59d87 View commit details
    Browse the repository at this point in the history
  3. fix dataloader

    YeAnbang committed Jun 24, 2024
    Configuration menu
    Copy the full SHA
    0b2d627 View commit details
    Browse the repository at this point in the history
  4. remove debug code

    YeAnbang committed Jun 24, 2024
    Configuration menu
    Copy the full SHA
    f3de5a0 View commit details
    Browse the repository at this point in the history

Commits on Jun 27, 2024

  1. add orpo

    YeAnbang committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    c8d1b4a View commit details
    Browse the repository at this point in the history
  2. fix style

    YeAnbang committed Jun 27, 2024
    Configuration menu
    Copy the full SHA
    8aad064 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    384c640 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    afa5306 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    b117274 View commit details
    Browse the repository at this point in the history

Commits on Jun 28, 2024

  1. Configuration menu
    Copy the full SHA
    e752776 View commit details
    Browse the repository at this point in the history
  2. fix torch colossalai version

    YeAnbang committed Jun 28, 2024
    Configuration menu
    Copy the full SHA
    a8af6cc View commit details
    Browse the repository at this point in the history
  3. update transformers version

    YeAnbang committed Jun 28, 2024
    Configuration menu
    Copy the full SHA
    ff53520 View commit details
    Browse the repository at this point in the history

Commits on Jul 5, 2024

  1. [shardformer] DeepseekMoE support (hpcaitech#5871)

    * [Feature] deepseek moe expert parallel implement
    
    * [misc] fix typo, remove redundant file (hpcaitech#5867)
    
    * [misc] fix typo
    
    * [pre-commit.ci] auto fixes from pre-commit.com hooks
    
    for more information, see https://pre-commit.ci
    
    ---------
    
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    
    * [Feature] deepseek support & unit test
    
    * [misc] remove debug code & useless print
    
    * [misc] fix typos (hpcaitech#5872)
    
    * [Feature] remove modeling file, use auto config. (hpcaitech#5884)
    
    * [misc] fix typos
    
    * [Feature] deepseek support via auto model, remove modeling file
    
    * [misc] delete useless file
    
    * [misc] fix typos
    
    * [Deepseek] remove redundant code (hpcaitech#5888)
    
    * [misc] fix typos
    
    * [Feature] deepseek support via auto model, remove modeling file
    
    * [misc] delete useless file
    
    * [misc] fix typos
    
    * [misc] remove redundant code
    
    * [Feature/deepseek] resolve comment. (hpcaitech#5889)
    
    * [misc] fix typos
    
    * [Feature] deepseek support via auto model, remove modeling file
    
    * [misc] delete useless file
    
    * [misc] fix typos
    
    * [misc] remove redundant code
    
    * [misc] mv module replacement into if branch
    
    * [misc] add some warning message and modify some code in unit test
    
    * [misc] fix typos
    
    ---------
    
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    Hz188 and pre-commit-ci[bot] authored Jul 5, 2024
    Configuration menu
    Copy the full SHA
    3420921 View commit details
    Browse the repository at this point in the history
  2. [Hoxfix] Fix CUDA_DEVICE_MAX_CONNECTIONS for comm overlap

    Co-authored-by: Edenzzzz <wtan45@wisc.edu>
    Edenzzzz and Edenzzzz authored Jul 5, 2024
    Configuration menu
    Copy the full SHA
    8ec24b6 View commit details
    Browse the repository at this point in the history

Commits on Jul 8, 2024

  1. [Feat] Diffusion Model(PixArtAlpha/StableDiffusion3) Support (hpcaite…

    …ch#5838)
    
    * Diffusion Model Inference support
    
    * Stable Diffusion 3 Support
    
    * pixartalpha support
    LRY89757 authored Jul 8, 2024
    Configuration menu
    Copy the full SHA
    cba2052 View commit details
    Browse the repository at this point in the history
  2. [HotFix] CI,import,requirements-test for hpcaitech#5838 (hpcaitech#5892)

    * [Hot Fix] CI,import,requirements-test
    
    ---------
    
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    LRY89757 and pre-commit-ci[bot] authored Jul 8, 2024
    Configuration menu
    Copy the full SHA
    66abf1c View commit details
    Browse the repository at this point in the history

Commits on Jul 9, 2024

  1. [Feature] Enable PP + SP for llama (hpcaitech#5868)

    * fix cross-PP-stage position id length diff bug
    
    * fix typo
    
    * fix typo
    
    * [pre-commit.ci] auto fixes from pre-commit.com hooks
    
    for more information, see https://pre-commit.ci
    
    * use a one cross entropy func for all shardformer models
    
    ---------
    
    Co-authored-by: Edenzzzz <wtan45@wisc.edu>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    3 people authored Jul 9, 2024
    Configuration menu
    Copy the full SHA
    fbf33ec View commit details
    Browse the repository at this point in the history

Commits on Jul 10, 2024

  1. Configuration menu
    Copy the full SHA
    16f3451 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    669849d View commit details
    Browse the repository at this point in the history
  3. add benchmark for sft, dpo, simpo, orpo. Add benchmarking result. Sup…

    …port lora with gradient checkpoint
    YeAnbang committed Jul 10, 2024
    Configuration menu
    Copy the full SHA
    d888c37 View commit details
    Browse the repository at this point in the history
  4. fix style

    YeAnbang committed Jul 10, 2024
    Configuration menu
    Copy the full SHA
    f6ef5c3 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    33f1520 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    8a9721b View commit details
    Browse the repository at this point in the history

Commits on Jul 11, 2024

  1. Merge pull request hpcaitech#5850 from hpcaitech/rlhf_SimPO

    [Chat] Rlhf support SimPO
    YeAnbang authored Jul 11, 2024
    Configuration menu
    Copy the full SHA
    dd9e1cd View commit details
    Browse the repository at this point in the history
  2. fix eval

    YeAnbang committed Jul 11, 2024
    Configuration menu
    Copy the full SHA
    e7a8634 View commit details
    Browse the repository at this point in the history
  3. hotfix citation

    YeAnbang committed Jul 11, 2024
    Configuration menu
    Copy the full SHA
    115c4cc View commit details
    Browse the repository at this point in the history
  4. [zero] support all-gather overlap (hpcaitech#5898)

    * [zero] support all-gather overlap
    
    * [zero] add overlap all-gather flag
    
    * [misc] fix typo
    
    * [zero] update api
    ver217 authored Jul 11, 2024
    Configuration menu
    Copy the full SHA
    c068ef0 View commit details
    Browse the repository at this point in the history

Commits on Jul 15, 2024

  1. fix orpo cross entropy loss

    YeAnbang committed Jul 15, 2024
    Configuration menu
    Copy the full SHA
    b3594d4 View commit details
    Browse the repository at this point in the history
  2. [Auto Parallel]: Speed up intra-op plan generation by 44% (hpcaitech#…

    …5446)
    
    * Remove unnecessary calls to deepcopy
    
    * Build DimSpec's difference dict only once
    
    This change considerably speeds up construction speed of DimSpec objects. The difference_dict is the same for each DimSpec object, so a single copy of it is enough.
    
    * Fix documentation of DimSpec's difference method
    stephankoe authored Jul 15, 2024
    Configuration menu
    Copy the full SHA
    45c49dd View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    1c961b2 View commit details
    Browse the repository at this point in the history

Commits on Jul 16, 2024

  1. Merge pull request hpcaitech#5901 from hpcaitech/colossalchat

    [Chat] fix eval: add in training evaluation, fix orpo sft loss bug
    YeAnbang authored Jul 16, 2024
    Configuration menu
    Copy the full SHA
    d8bf7e0 View commit details
    Browse the repository at this point in the history
  2. [compatibility] support torch 2.2 (hpcaitech#5875)

    * Support Pytorch 2.2.2
    
    * keep build_on_pr file and update .compatibility
    GuangyaoZhang authored and ver217 committed Jul 16, 2024
    Configuration menu
    Copy the full SHA
    2e28c79 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    530283d View commit details
    Browse the repository at this point in the history
  4. [misc] support torch2.3 (hpcaitech#5893)

    * [misc] support torch2.3
    
    * [devops] update compatibility ci
    
    * [devops] update compatibility ci
    
    * [devops] add debug
    
    * [devops] add debug
    
    * [devops] add debug
    
    * [devops] add debug
    
    * [devops] remove debug
    
    * [devops] remove debug
    ver217 committed Jul 16, 2024
    Configuration menu
    Copy the full SHA
    27a72f0 View commit details
    Browse the repository at this point in the history

Commits on Jul 17, 2024

  1. Configuration menu
    Copy the full SHA
    73494de View commit details
    Browse the repository at this point in the history

Commits on Jul 18, 2024

  1. [plugin] support all-gather overlap for hybrid parallel (hpcaitech#5919)

    * [plugin] fixed all-gather overlap support for hybrid parallel
    ver217 authored Jul 18, 2024
    Configuration menu
    Copy the full SHA
    e861279 View commit details
    Browse the repository at this point in the history
  2. add kto

    YeAnbang committed Jul 18, 2024
    Configuration menu
    Copy the full SHA
    09d5ffc View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    845ea72 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    544b7a3 View commit details
    Browse the repository at this point in the history

Commits on Jul 19, 2024

  1. [Examples] Add lazy init to OPT and GPT examples (hpcaitech#5924)

    Co-authored-by: Edenzzzz <wtan45@wisc.edu>
    Edenzzzz and Edenzzzz authored Jul 19, 2024
    Configuration menu
    Copy the full SHA
    8cc8f64 View commit details
    Browse the repository at this point in the history
  2. [ColossalChat] Hotfix for ColossalChat (hpcaitech#5910)

    * add ignore and tiny llama
    
    * fix path issue
    
    * run style
    
    * fix issue
    
    * update bash
    
    * add ignore and tiny llama
    
    * fix path issue
    
    * run style
    
    * fix issue
    
    * update bash
    
    * fix ddp issue
    
    * add Qwen 1.5 32B
    TongLi3701 authored Jul 19, 2024
    Configuration menu
    Copy the full SHA
    f585d4e View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    d08c99b View commit details
    Browse the repository at this point in the history
  4. refactor tokenization

    YeAnbang committed Jul 19, 2024
    Configuration menu
    Copy the full SHA
    d49550f View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    150505c View commit details
    Browse the repository at this point in the history

Commits on Jul 21, 2024

  1. [FIX BUG] UnboundLocalError: cannot access local variable 'default_co…

    …nversation' where it is not associated with a value (hpcaitech#5931)
    
    * cannot access local variable 'default_conversation' where it is not associated with a value
    
    set default value for 'default_conversation'
    
    * [pre-commit.ci] auto fixes from pre-commit.com hooks
    
    for more information, see https://pre-commit.ci
    
    ---------
    
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    zhurunhua and pre-commit-ci[bot] authored Jul 21, 2024
    Configuration menu
    Copy the full SHA
    4ec17a7 View commit details
    Browse the repository at this point in the history

Commits on Jul 22, 2024

  1. fix test data

    YeAnbang committed Jul 22, 2024
    Configuration menu
    Copy the full SHA
    c5f582f View commit details
    Browse the repository at this point in the history
  2. refactor evaluation

    YeAnbang committed Jul 22, 2024
    Configuration menu
    Copy the full SHA
    12fe8b5 View commit details
    Browse the repository at this point in the history
  3. remove real data path

    YeAnbang committed Jul 22, 2024
    Configuration menu
    Copy the full SHA
    b0e15d5 View commit details
    Browse the repository at this point in the history
  4. remove real data path

    YeAnbang committed Jul 22, 2024
    Configuration menu
    Copy the full SHA
    9688e19 View commit details
    Browse the repository at this point in the history

Commits on Jul 23, 2024

  1. Configuration menu
    Copy the full SHA
    a521ffc View commit details
    Browse the repository at this point in the history

Commits on Jul 24, 2024

  1. Configuration menu
    Copy the full SHA
    5fb958c View commit details
    Browse the repository at this point in the history

Commits on Jul 25, 2024

  1. [Hotfix] Fix ZeRO typo hpcaitech#5936

    Co-authored-by: Edenzzzz <wtan45@wisc.edu>
    Edenzzzz and Edenzzzz authored Jul 25, 2024
    Configuration menu
    Copy the full SHA
    2069472 View commit details
    Browse the repository at this point in the history

Commits on Jul 26, 2024

  1. [Feature] Add a switch to control whether the model checkpoint needs …

    …to be saved after each epoch ends (hpcaitech#5941)
    
    * Add a switch to control whether the model checkpoint needs to be saved after each epoch ends
    
    * [pre-commit.ci] auto fixes from pre-commit.com hooks
    
    for more information, see https://pre-commit.ci
    
    ---------
    
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    zhurunhua and pre-commit-ci[bot] authored Jul 26, 2024
    Configuration menu
    Copy the full SHA
    ad35a98 View commit details
    Browse the repository at this point in the history
  2. fix style

    YeAnbang committed Jul 26, 2024
    Configuration menu
    Copy the full SHA
    8a3ff4f View commit details
    Browse the repository at this point in the history
  3. fix style

    YeAnbang committed Jul 26, 2024
    Configuration menu
    Copy the full SHA
    de1bf08 View commit details
    Browse the repository at this point in the history

Commits on Jul 29, 2024

  1. fix style

    YeAnbang committed Jul 29, 2024
    Configuration menu
    Copy the full SHA
    6fd9e86 View commit details
    Browse the repository at this point in the history
  2. Merge pull request hpcaitech#5922 from hpcaitech/kto

    [Chat] Add KTO
    YeAnbang authored Jul 29, 2024
    Configuration menu
    Copy the full SHA
    c8332b9 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    9664b1b View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    7b38964 View commit details
    Browse the repository at this point in the history

Commits on Jul 30, 2024

  1. [Feat] Distrifusion Acceleration Support for Diffusion Inference (hpc…

    …aitech#5895)
    
    * Distrifusion Support source
    
    * comp comm overlap optimization
    
    * sd3 benchmark
    
    * pixart distrifusion bug fix
    
    * sd3 bug fix and benchmark
    
    * generation bug fix
    
    * naming fix
    
    * add docstring, fix counter and shape error
    
    * add reference
    
    * readme and requirement
    LRY89757 authored Jul 30, 2024
    Configuration menu
    Copy the full SHA
    bcf0181 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0608921 View commit details
    Browse the repository at this point in the history

Commits on Jul 31, 2024

  1. Configuration menu
    Copy the full SHA
    09c5f72 View commit details
    Browse the repository at this point in the history
  2. [Chat] Fix lora (hpcaitech#5946)

    * fix merging
    
    * remove filepath
    
    * fix style
    YeAnbang authored Jul 31, 2024
    Configuration menu
    Copy the full SHA
    30f4e31 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    66fbf2e View commit details
    Browse the repository at this point in the history
  4. [hotfix] Remove unused plan section (hpcaitech#5957)

    * remove readme
    
    * fix readme
    
    * update
    TongLi3701 authored Jul 31, 2024
    Configuration menu
    Copy the full SHA
    1aeb5e8 View commit details
    Browse the repository at this point in the history

Commits on Aug 1, 2024

  1. Configuration menu
    Copy the full SHA
    f9b6fcf View commit details
    Browse the repository at this point in the history
  2. [test] add mixtral transformer test

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    0b76b57 View commit details
    Browse the repository at this point in the history
  3. [moe] fix plugin

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    8ae8525 View commit details
    Browse the repository at this point in the history
  4. [test] mixtra pp shard test

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    a249e71 View commit details
    Browse the repository at this point in the history
  5. [chore] handle non member group

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    0fad23c View commit details
    Browse the repository at this point in the history
  6. [zero] solve hang

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    46c069b View commit details
    Browse the repository at this point in the history
  7. [test] pass mixtral shardformer test

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    37443cc View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    b5bfeb2 View commit details
    Browse the repository at this point in the history
  9. [zero] solve hang

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    13b48ac View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    fe24789 View commit details
    Browse the repository at this point in the history
  11. solve hang when parallel mode = pp + dp

    Hz188 authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    5ed5e8c View commit details
    Browse the repository at this point in the history
  12. [moe] implement submesh initialization

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    e28e053 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    9b9b76b View commit details
    Browse the repository at this point in the history
  14. [chore] manually revert unintended commit

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    014faf6 View commit details
    Browse the repository at this point in the history
  15. [chore] trivial fix

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    8dbb868 View commit details
    Browse the repository at this point in the history
  16. [chore] arg pass & remove drop token

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    102b784 View commit details
    Browse the repository at this point in the history
  17. [test] add mixtral modelling test

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    0b5bbe9 View commit details
    Browse the repository at this point in the history
  18. [moe] implement tp

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    dc583aa View commit details
    Browse the repository at this point in the history
  19. [moe] test deepseek

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    74eccac View commit details
    Browse the repository at this point in the history
  20. [moe] clean legacy code

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    3e2b613 View commit details
    Browse the repository at this point in the history
  21. [Feature] MoE Ulysses Support (hpcaitech#5918)

    * moe sp support
    
    * moe sp bug solve
    
    * [pre-commit.ci] auto fixes from pre-commit.com hooks
    
    for more information, see https://pre-commit.ci
    
    ---------
    
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    2 people authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    404b16f View commit details
    Browse the repository at this point in the history
  22. [chore] minor fix

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    09d6280 View commit details
    Browse the repository at this point in the history
  23. [moe] init moe plugin comm setting with sp

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    877d94b View commit details
    Browse the repository at this point in the history
  24. moe sp + ep bug fix

    Hz188 authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    2cddeac View commit details
    Browse the repository at this point in the history
  25. [moe] finalize test (no pp)

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    7077d38 View commit details
    Browse the repository at this point in the history
  26. Configuration menu
    Copy the full SHA
    803878b View commit details
    Browse the repository at this point in the history
  27. [chore] minor fix after rebase

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    46037c2 View commit details
    Browse the repository at this point in the history
  28. [pre-commit.ci] auto fixes from pre-commit.com hooks

    for more information, see https://pre-commit.ci
    pre-commit-ci[bot] authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    52d346f View commit details
    Browse the repository at this point in the history
  29. Configuration menu
    Copy the full SHA
    70c9924 View commit details
    Browse the repository at this point in the history
  30. [moe] remove ops

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    74b03de View commit details
    Browse the repository at this point in the history
  31. [test] fix test: test_zero1_2

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    067e18f View commit details
    Browse the repository at this point in the history
  32. Configuration menu
    Copy the full SHA
    96d0fbc View commit details
    Browse the repository at this point in the history
  33. [moe] deepseek moe sp support

    Hz188 authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    b2952a5 View commit details
    Browse the repository at this point in the history
  34. [test] add check

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    6c39f0b View commit details
    Browse the repository at this point in the history
  35. Configuration menu
    Copy the full SHA
    c3dc9b4 View commit details
    Browse the repository at this point in the history
  36. [misc] skip redunant test

    Hz188 authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    59bcf56 View commit details
    Browse the repository at this point in the history
  37. [misc] remove debug/print code

    Hz188 authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    034020b View commit details
    Browse the repository at this point in the history
  38. [moe] refactor mesh assignment

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    cb01c0d View commit details
    Browse the repository at this point in the history
  39. Revert "[moe] implement submesh initialization"

    This reverts commit 2f9bce6.
    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    5b4c123 View commit details
    Browse the repository at this point in the history
  40. [chore] change moe_pg_mesh to private

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    606b089 View commit details
    Browse the repository at this point in the history
  41. [misc] remove incompatible test config

    Hz188 authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    12d043c View commit details
    Browse the repository at this point in the history
  42. Configuration menu
    Copy the full SHA
    70793ce View commit details
    Browse the repository at this point in the history
  43. [misc] remove useless condition

    Hz188 authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    7e737df View commit details
    Browse the repository at this point in the history
  44. [chore] docstring

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    f7c5485 View commit details
    Browse the repository at this point in the history
  45. Configuration menu
    Copy the full SHA
    7bedd03 View commit details
    Browse the repository at this point in the history
  46. Configuration menu
    Copy the full SHA
    65daa87 View commit details
    Browse the repository at this point in the history
  47. [moe] solve dp axis issue

    botbw authored and ver217 committed Aug 1, 2024
    Configuration menu
    Copy the full SHA
    d1d1ab8 View commit details
    Browse the repository at this point in the history
  48. Configuration menu
    Copy the full SHA
    62cdac6 View commit details
    Browse the repository at this point in the history

Commits on Aug 2, 2024

  1. [feat] Dist Loader for Eval (hpcaitech#5950)

    * support auto distributed data loader
    
    * [pre-commit.ci] auto fixes from pre-commit.com hooks
    
    for more information, see https://pre-commit.ci
    
    * support auto distributed data loader
    
    * [pre-commit.ci] auto fixes from pre-commit.com hooks
    
    for more information, see https://pre-commit.ci
    
    * fix tp error
    
    * remove unused parameters
    
    * remove unused
    
    * update inference
    
    * update docs
    
    * update inference
    
    ---------
    
    Co-authored-by: Michelle <qianranma8@gmail.com>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    3 people authored Aug 2, 2024
    Configuration menu
    Copy the full SHA
    19d1510 View commit details
    Browse the repository at this point in the history
  2. [lora] lora support hybrid parallel plugin (hpcaitech#5956)

    * lora support hybrid plugin
    
    * fix
    
    * fix
    
    * fix
    
    * fix
    wangbluo authored Aug 2, 2024
    Configuration menu
    Copy the full SHA
    75c9636 View commit details
    Browse the repository at this point in the history
  3. fp8 operators for compressed communication

    cast_to_fp8, cast_from_fp8, all_reduce_fp8
    BurkeHulk authored and flybird11111 committed Aug 2, 2024
    Configuration menu
    Copy the full SHA
    82c8475 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    99a9bf3 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    7052579 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    c30a24e View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    3b1c861 View commit details
    Browse the repository at this point in the history
  8. fix typo

    GuangyaoZhang authored and flybird11111 committed Aug 2, 2024
    Configuration menu
    Copy the full SHA
    fb9486c View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    e60fcdd View commit details
    Browse the repository at this point in the history
  10. shardformer fp8

    GuangyaoZhang authored and flybird11111 committed Aug 2, 2024
    Configuration menu
    Copy the full SHA
    778513e View commit details
    Browse the repository at this point in the history
  11. fix rebase

    GuangyaoZhang authored and flybird11111 committed Aug 2, 2024
    Configuration menu
    Copy the full SHA
    259e696 View commit details
    Browse the repository at this point in the history
  12. remove all to all

    GuangyaoZhang authored and flybird11111 committed Aug 2, 2024
    Configuration menu
    Copy the full SHA
    f7c7273 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    afe4200 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    1e7293f View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    047feb9 View commit details
    Browse the repository at this point in the history

Commits on Aug 5, 2024

  1. Configuration menu
    Copy the full SHA
    811d5af View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0e6e488 View commit details
    Browse the repository at this point in the history
  3. fix

    flybird11111 committed Aug 5, 2024
    Configuration menu
    Copy the full SHA
    7e0c777 View commit details
    Browse the repository at this point in the history

Commits on Aug 6, 2024

  1. Configuration menu
    Copy the full SHA
    6f29436 View commit details
    Browse the repository at this point in the history