When scheduler stops scheduling, does it stop all projects or the one you are trying to restart?
Trace the contents of the scheduler log about project %s updated, status:%s, paused:%s, %d tasks to see if schduler knows that the project status has changed.
unknown project 如果 project 确实存在,是不应该出现的 not processing pack This is normal. After the scheduler restarts, the previously distributed tasks cannot be tracked. When starting, the scheduler needs to restore the status of all active tasks from the database. If there are many tasks, it will indeed be time-consuming.
This problem has been found. In the source code of pyspider, the status_count query of statusdb under mongodb under database is very slow when the amount of data is extremely large, which will cause the scheduler to start up very long
When scheduler stops scheduling, does it stop all projects or the one you are trying to restart?
Trace the contents of the scheduler log about
project %s updated, status:%s, paused:%s, %d tasks
to see if schduler knows that the project status has changed.unknown project
如果 project 确实存在,是不应该出现的not processing pack
This is normal. After the scheduler restarts, the previously distributed tasks cannot be tracked.When starting, the scheduler needs to restore the status of all active tasks from the database. If there are many tasks, it will indeed be time-consuming.
This problem has been found. In the source code of pyspider, the status_count query of statusdb under mongodb under database is very slow when the amount of data is extremely large, which will cause the scheduler to start up very long