Questions and Answers : Windows : Some Rosetta files do not progress anymore
Author | Message |
---|---|
beguwe1 Send message Joined: 7 Mar 10 Posts: 1 Credit: 178,223 RAC: 0 |
Hi, I am having the following issue sometimes on more than one PC: - some Rosetta files progress till a certain % - then the processing time is still increasing, but the progress % indicator stays as-is. - so no progress anymore on the file. -if the 'estimated time to complete' is at zero, there is a dash (-) placed in that column. - rebooting the PC does not help, only 'work aboard' helps. ==> anybody knows the root cause of this issue? (i have BOINC version 7.6.9 (Windows x64) Best Regards, /Guy Belgium |
dsalt2 Send message Joined: 30 Apr 07 Posts: 1 Credit: 3,760,856 RAC: 25 |
I have exactly the same problem! Tasks get to a certain percentage then stop going up in % but the run-time carries on increasing for ever and never stops. Rebooting doesn't work, I end up having to abort the tasks. HELP! Rosetta 3.73 Boinc 7.6.22 (x64) Windows 10 x64 |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
If rebooting the machine does not get things going well again, then it is not the problem I often see where BOINC Manager does not actually give the thread CPU time (even though it shows the task as "running"). So this may be an issue with specific tasks. A Link to the task and the task name might be helpful. And the Number Crunching board has many more readers than the Q&A boards, so I'd suggest posting the details there. Is it possible that the BOINC Manager ran out of disk space available? Rosetta Moderator: Mod.Sense |
Mark Send message Joined: 11 Oct 16 Posts: 8 Credit: 363,827 RAC: 0 |
Bump. Stuck tasks: The recommended first-aid is to hit "Reset project," which removes all pending tasks. After that, a new batch of tasks arrives; 6 of them begin to run; one finishes within a few minutes and uploads results to headquarters; the others get stuck. Stuck tasks load gigabytes of data into RAM, where it remains even when the tasks are suspended. Screen grab from Windows Task Manager, identical for running and suspended tasks: https://postimg.org/image/fherq5zcp/ Copious unused disk space and RAM space remains. ----- Windows 10 Home 64-bit, x-64-based processor Rosetta Mini 3.73 BOINC Manager 7.6.22 (x64) |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Your screenshot shows 0% CPU going to the tasks. The BOINC Manager determines what gets CPU and what doesn't. The memory they consume is normal. And if the tasks are not running, then Windows will swap them out of memory. But if your BOINC settings do not allow BOINC to use more memory, that could prevent tasks from progressing. There are a number of possibilities for why tasks might not be getting CPU. If the screenshot doesn't show all of the BOINC work units, perhaps other work units are running and using all of the memory BOINC is allowed to use. Also, if there are other, non-BOINC, tasks keeping the machine busy, BOINC Manager has a threshold of CPU % busy where once your machine gets too busy, it steps out of the way and suspends BOINC activity until the machine is less busy. I'd suggest completely ending BOINC Manager ("exit", not "close"), and restarting. Then check the BOINC Manager events log and the tasks display in the advanced view to see if there are indications of why tasks are being suspended. Also, if you could explain more about your BOINC settings for the computing preferences, that would help put things in perspective. Rosetta Moderator: Mod.Sense |
Mark Send message Joined: 11 Oct 16 Posts: 8 Credit: 363,827 RAC: 0 |
Thanks for the reply. The screenshot shows the typical situation when BOINC Manager says that 6 tasks are running. The CPU is at 0% overall, not just for Rosetta tasks, so plenty of CPU is available. RAM shows as 39% in use -- that's 61% or 9.6 GB free, so plenty of RAM is available. And 105 GB of disk space is available. Exiting and restarting BOINC Manager doesn't help. Rebooting the computer doesn't help. After an exit-restart, the Event Log reports "Resuming computation" and "Resuming network activity" and gives no indication of a problem. SETTINGS Use at most 75% of the CPUs Use at most 75% of CPU time Suspend when computer is in use 'In use" means...3 minutes Suspend when non-BOINC CPU usage is above 25% Network: default settings Disk: Leave at least 0.1 GB free Disk: Use no more than 90% of total Memory: When computer is in use, use at most 50% [not applicable] Memory: When computer is not in use use, use at most 90% Memory: Page/swap file: use at most 75% Daily schedules: default settings (blank) |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2474 Credit: 46,506,558 RAC: 3,757 ![]() |
Thanks for the reply. Try these changes: Use at most 100% of the CPUs Use at most 100% of CPU time Memory: When computer is in use, use at most 65% Leave non-GPU tasks in memory when suspended - Tick Did you say how much RAM you have? ![]() ![]() |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Did you say how much RAM you have? They have 8 CPUs (with max 75% of CPUs) and 16GB. Is there a known bug in BOINC Manager? Seems it should always have at least one task running if memory permits (and it does). Rosetta Moderator: Mod.Sense |
Mark Send message Joined: 11 Oct 16 Posts: 8 Credit: 363,827 RAC: 0 |
Did you say how much RAM you have? Correct. Re. your list in an earlier post of circumstances that cause BOINC Manager to deny CPU to tasks: My expectation is that if BOINC Manager purposely suspends activity, for the reasons you suggest or any other, then the UI ought to reflect this state of affairs. The "Status" column should say something other than "Running," the "Elapsed" column should stop ticking off the seconds, and the event log should give some hint. None of that happened, I think, for the users posting in this thread. I am thrilled and grateful that you and Sid are giving this matter some attention. Before posting here, I tried raising the issue in 3 other locations (BOINC, MilkyWay, email support) and received no response. So thanks. |
Mark Send message Joined: 11 Oct 16 Posts: 8 Credit: 363,827 RAC: 0 |
RAM: 16.0 GB All right. I uninstalled BOINC, cleaned up, downloaded a fresh installer, installed BOINC, and (sceptically) applied your suggested settings. Also changed "Suspend when non-BOINC CPU usage is above" to 40% from 25%. (The original settings were copied from a 5-year-old, much less powerful Windows PC, which was running BOINC to perfection, though glacially.) So far, so good. BOINC and Rosetta appear to be operating as expected. ---------- Rosetta Mini 3.73 BOINC Manager 7.6.22 (x64) --- Windows 10 Home 64-bit, x-64-based processor, English (Canada) -- ASUS Desktop PC M32CD Series Intel(R) Core(TM) i7-6700 CPU @ 3.40 GHz Skylake 14nm Technology Motherboard: ASUSTeK COMPUTER INC. M32CD_A_F_K20CD_K31CD (LGA1151) RAM: 16.00 GB GPU: AMD Radeon R9 370 -- Windows Defender Windows Firewall WinPatrol PLUS |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2474 Credit: 46,506,558 RAC: 3,757 ![]() |
Is there a known bug in BOINC Manager? Don't even go there... ;) ![]() ![]() |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2474 Credit: 46,506,558 RAC: 3,757 ![]() |
Try these changes: I also have an 8-core machine with 16Gb ram, but an AMD. By and large everything should be the same between us. Also changed "Suspend when non-BOINC CPU usage is above" to 40% from 25%. (The original settings were copied from a 5-year-old, much less powerful Windows PC, which was running BOINC to perfection, though glacially.) I didn't notice this first time round. While the other changes are worth doing, this seems more significant. Given that all Rosetta tasks run at low-priority, and all non-Boinc tasks run at a higher setting (Normal) this setting seems redundant. It might've been necessary with an underpowered PC, but imo serves no purpose with the power of your current machine. All non-Boinc processing has a higher priority from the first 1%. Suspending tasks altogether when it gets above a certain point seems like overkill to me. I have this option unticked and don't notice a problem. Why waste time even checking, given the above? See if you notice any difference (though your change may be sufficient in itself). ![]() ![]() |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
My expectation is that if BOINC Manager purposely suspends activity, for the reasons you suggest or any other, then the UI ought to reflect this state of affairs. The "Status" column should say something other than "Running," the "Elapsed" column should stop ticking off the seconds, and the event log should give some hint. None of that happened, I think, for the users posting in this thread. Agreed. I believe when the problem occurs, the elapsed time increases, but in the properties of the tasks, you'll see that the CPU time is NOT increasing. This also results in outrageous "estimated" time to completion (since this is just computed from the fraction done and the elapsed time). If Sid's suggested settings continue to work for you (and you have time and interest to tinker further), I'd suggest changing settings one-at-a-time until you see the zombie tasks again. The most significant to your environment I should think would be the 75% of the CPUs, thus leaving you two CPUs always available for non-BOINC activity. You must have desired that setting to keep a reserve for some other activity on the machine. Rosetta Moderator: Mod.Sense |
Mark Send message Joined: 11 Oct 16 Posts: 8 Credit: 363,827 RAC: 0 |
...I'd suggest changing settings one-at-a-time until you see the zombie tasks again. Here are the results. Each of the options referred to in this thread was reverted, one at a time, to its previous setting. The culprit turned out to be "Use at most 75% of CPU time." At that setting, all heck broke loose. CPU utilization showed wild swings, overall and for each Rosetta task. After a minute, one of those tasks went to 0% and turned zombie as described above. Life returned when the setting was changed back to 100%, but only after BOINC Manager was closed and the computer rebooted. |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
So you have two CPU settings, 75% of the number of CPUs, and 75% of total CPU power. So were you able to get the 75% of CPUs (i.e. 6 CPUs) running at 100%? Rosetta Moderator: Mod.Sense |
Mark Send message Joined: 11 Oct 16 Posts: 8 Credit: 363,827 RAC: 0 |
So you have two CPU settings, 75% of the number of CPUs, and 75% of total CPU power. So were you able to get the 75% of CPUs (i.e. 6 CPUs) running at 100%? On my system, when BOINC is configured for 75% of CPUs and the evidently mandatory 100% of CPU power, BOINC projects receive a steady 75% of CPU and each task gets about 12.5% (75/6, 100/8). My system can't handle the "75% of CPU power" setting, as described above. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2474 Credit: 46,506,558 RAC: 3,757 ![]() |
...I'd suggest changing settings one-at-a-time until you see the zombie tasks again. A long time ago, using a very different machine and a much older Boinc, I set a reduced %age of CPU time and noticed that what actually happened was 100% of CPU time was used for (say) 75% of the time and 0% for the remainder of the time, switching multiple times per second. I always knew it was a weird setting, though I didn't know it would do what you've reported. Interesting to note. So you're happy with how things are going? Good to hear, if so. ![]() ![]() |
Mark Send message Joined: 11 Oct 16 Posts: 8 Credit: 363,827 RAC: 0 |
A long time ago, using a very different machine and a much older Boinc, I set a reduced %age of CPU time and noticed that what actually happened was 100% of CPU time was used for (say) 75% of the time and 0% for the remainder of the time, switching multiple times per second. I always knew it was a weird setting, though I didn't know it would do what you've reported. Interesting to note. Ah, that may explain what I was seeing in the Windows Task Manager at the 75% setting. Happy? Yes. Although if a reduced CPU time setting ever becomes workable for me, I certainly want to take advantage -- to lower CPU temperature. |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
The only means BOINC Manager has of implementing "75% of CPU time" is to go 100% for a period of time and then go zero for a period of time to average out to 75%. And so far as CPU temps go, that's all you need. The temps average out the same way the CPU % does. Computers don't just run at 75%. They're either on or off. Even when there is nothing to do, it has to spin the extra cycles in the idle task. The gotta go somewhere. When using 6 out of the 8 CPUs (i.e. 75% of the processors rather than of the CPU) I'm guessing the tasks settle in to specific CPUs and don't leave. In other words you have 6 CPUs running 100% of the time, and two that are only supporting operating system overhead and other non-BOINC tasks. The good news is most systems are properly designed to deliver sufficient airflow to keep the CPUs are a healthy temp. Rosetta Moderator: Mod.Sense |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2474 Credit: 46,506,558 RAC: 3,757 ![]() |
A long time ago, using a very different machine and a much older Boinc, I set a reduced %age of CPU time and noticed that what actually happened was 100% of CPU time was used for (say) 75% of the time and 0% for the remainder of the time, switching multiple times per second. I always knew it was a weird setting, though I didn't know it would do what you've reported. Interesting to note. I'm not sure I ever saw it in the Task Manager. I used to use a sidebar gadget under Vista at one time that revealed it. Temperatures are certainly an issue. I'm overclocking and use HWInfo64 to keep an eye on that. If the 75% of CPU's setting (for 6 of 8 cores) worked ok for you (others report it working fine too) wouldn't that knock the overall temperature down? I don't know as I've never tried it but definitely worth experimenting with for a few days. A good halfway-house setting and more like what you were setting out to do in the first place it seems to me. ![]() ![]() |
Questions and Answers :
Windows :
Some Rosetta files do not progress anymore
©2025 University of Washington
https://www.bakerlab.org