Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

run demo report error #6

Open
gpsherry opened this issue Nov 11, 2020 · 9 comments
Open

run demo report error #6

gpsherry opened this issue Nov 11, 2020 · 9 comments

Comments

@gpsherry
Copy link

Hi
I run the demo and report one problem as the follow picture and my environment is follow:
cuda=10.1
pytorch=1.5
mmdetection=2.6
mmcv-full=1.15
Do you know the problem?
WX20201111-173042@2x

@hyz-xmaster
Copy link
Owner

I guess this problem may be caused by the incompatibility of your mmcv and pytorch. If you have used this command pip install mmcv-full==latest+torch1.6.0+cu101 -f https://download.openmmlab.com/mmcv/dist/index.html to install mmcv, then your pytorch version should be 1.6.0. Alternatively, you can change the command to pip install mmcv-full==latest+torch1.5.0+cu101 -f https://download.openmmlab.com/mmcv/dist/index.html to be compatible with your version of pytorch. By the way, if you reinstall mmcv, you need first uninstall the installed one.

@gpsherry
Copy link
Author

Hi
The method of you say is not useful, the same error is report. but i modify the mmdet/api/inference.py of device = next(model.parameters()).device to device = 0 ,then the problem is not occur.why??The code is have bug?

hyz-xmaster added a commit that referenced this issue Nov 11, 2020
@hyz-xmaster
Copy link
Owner

Hi, the problem is fixed. Please see the commit. It is caused by one bug of MMDetection.

@gpsherry
Copy link
Author

gpsherry commented Nov 11, 2020

Hi
I meet a new problem during the train model process, the train information of Loss is not print.

WX20201111-200333@2x

@hyz-xmaster
Copy link
Owner

Hi
I meet a new problem during the train model process, the train information of Loss is not print.

WX20201111-200333@2x

I have no experience of using TensorBoard to view the training process. You may use dict(type='TextLoggerHook') instead of dict(type='TensorboardLoggerHook') to monitor the training.

@gpsherry
Copy link
Author

I have used dict(type='TextLoggerHook') in the first time, but i can not see the train log, so i use dict(type='TensorboardLoggerHook') and can not see train log. now, i don't know my train problem that the mode is convergent or not.

@hyz-xmaster
Copy link
Owner

Are you running the training on a very small dataset? If it is, setting interval=1 should print the log. By the way, I do not think it will work to train such big a network on a that small dataset.

@gpsherry
Copy link
Author

I test in original mmdetection code in my dataset is no problem and i debug your code find you train_pipeline load trian data len is 1 but test_pipeline is no problem.

@gpsherry
Copy link
Author

Hi
The train log problem is over, the coco.py add the filter of CLASSNAME and modify the classname is execute no matter.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants