Help with Win SMP

Associate
Joined
11 Feb 2007
Posts
179
Hi Guys,

Been running the Win SMP for just over a week now and I seem to have hit a problem. I just finished my 13th WU and I seem to be stuck in a loop (see log below). I have closed the session down with CTRL C and have rebooted but when I start up fah.exe I just get this?

Not sure what to do to get going again. Don't really want to delete the WU as it is my first 2651 @ 1760 points and it completed 100%. If anyone can help please do.

Cheers

Starting FAH for the first time
-------------------------------


Note: Please read the license agreement (fah.exe -license). Further
use of this software requires that you have read and accepted this agreement.

Using local directory for work files
2 cores detected


--- Opening Log file [April 11 21:33:03]


# SMP Client ##################################################################
###############################################################################

Folding@Home Client Version 5.91beta

http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Program Files (x86)\Folding@Home Windows SMP Client V1.01
Executable: C:\Program Files (x86)\Folding@Home Windows SMP Client V1.01\fah.exe

Arguments: -oneunit -verbosity 9 -local

[21:33:03] - Ask before connecting: No
[21:33:03] - User name: xgeek (Team 10)
[21:33:03] - User ID: 1EF0545C774269C5
[21:33:03] - Machine ID: 1
[21:33:03]
[21:33:03] Loaded queue successfully.
[21:33:03] - Preparing to get new work unit...
[21:33:03] + Attempting to get work packet
[21:33:03] - Autosending finished units...
[21:33:03] - Will indicate memory of 2045 MB
[21:33:03] Trying to send all finished work units
[21:33:03] - Connecting to assignment server
[21:33:03] Connecting to http://assign.stanford.edu:8080/


[21:33:03] + Attempting to send results
[21:33:03] - Reading file work/wuresults_01.dat from core
[21:33:03] (Read 5512621 bytes from disk)
[21:33:03] Connecting to http://171.64.65.64:8080/
[21:33:05] Posted data.
[21:33:05] Initial: 0000; - Successful: assigned to (0.0.0.0).
[21:33:05] + News From Folding@Home: Welcome to Folding@Home
[21:33:05] Work Unit has an invalid address.
[21:33:05] - Error: Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[21:33:06] - Couldn't send HTTP request to server
[21:33:06] + Could not connect to Work Server (results)
[21:33:06] (171.64.65.64:8080)
[21:33:06] - Error: Could not transmit unit 01 (completed April 11) to work serv
er.
[21:33:06] - 7 failed uploads of this unit.


[21:33:06] + Attempting to send results
[21:33:06] - Reading file work/wuresults_01.dat from core
[21:33:06] (Read 5512621 bytes from disk)
[21:33:06] Connecting to http://171.65.103.100:8080/
[21:33:15] + Attempting to get work packet
[21:33:15] - Will indicate memory of 2045 MB
[21:33:15] - Connecting to assignment server
[21:33:15] Connecting to http://assign.stanford.edu:8080/
[21:33:16] Posted data.
[21:33:16] Initial: 0000; - Successful: assigned to (0.0.0.0).
[21:33:16] + News From Folding@Home: Welcome to Folding@Home
[21:33:16] Work Unit has an invalid address.
[21:33:16] - Error: Attempt #2 to get work failed, and no other work to do.
Waiting before retry.
[21:33:28] + Attempting to get work packet
[21:33:28] - Will indicate memory of 2045 MB
[21:33:28] - Connecting to assignment server
[21:33:28] Connecting to http://assign.stanford.edu:8080/
[21:33:30] Posted data.
[21:33:30] Initial: 0000; - Successful: assigned to (0.0.0.0).
[21:33:30] + News From Folding@Home: Welcome to Folding@Home
[21:33:30] Work Unit has an invalid address.
[21:33:30] - Error: Attempt #3 to get work failed, and no other work to do.
Waiting before retry.
[21:33:51] + Attempting to get work packet
[21:33:51] - Will indicate memory of 2045 MB
[21:33:51] - Connecting to assignment server
[21:33:51] Connecting to http://assign.stanford.edu:8080/
[21:33:53] Posted data.
[21:33:53] Initial: 0000; - Successful: assigned to (0.0.0.0).
[21:33:53] + News From Folding@Home: Welcome to Folding@Home
[21:33:53] Work Unit has an invalid address.
[21:33:53] - Error: Attempt #4 to get work failed, and no other work to do.
Waiting before retry.
 
Servers are down. Lots of people bemoaning it on the official forums.

Nothing to do but wait :(
 
Last edited:
I had 4 PCs yesterday that weren't uploading results and also couldn't get new work. In addition I had 5 others that had sent completed WUs but couldn't get new work. I'm now down to 1 that can't send or recieve and 4 that aren't recieving... slow progress.

I sometimes think if members of the folding community did a bit more donating to Stanford and a bit less spending on personal rigs the project would benefit a lot. Then again shouldn't a University like Stanford be able to get the small amount of funding to improve it's servers and infrastructure? Especially as FAH is pretty high profile.

I should put my money where my mouth is and donate a tenner I guess.
 
verbal said:
I should put my money where my mouth is and donate a tenner I guess.
I'll be keeping my tenner for now - at least until the long overdue leccy bill arrives (we think they've forgotten about us after we switched from PAYG meter to a normal one) :o :p

New server hardware is already on its way to Stanford so there should be some improvement shortly... then V6 can be released and we can complain about that instead :D
 
rich99million said:
I'll be keeping my tenner for now - at least until the long overdue leccy bill arrives (we think they've forgotten about us after we switched from PAYG meter to a normal one) :o :p

The key is to keep asking for a bill (sending your correct details) and that way you get 2 years free lecky! worked for me in my last place, then I moved house and now we have to pay for what we use like most people. :rolleyes:

rich99million said:
New server hardware is already on its way to Stanford so there should be some improvement shortly... then V6 can be released and we can complain about that instead :D

You got to love the british and there need to grumple ;)
 
Still seems to be playing up. My Wu from yesterday has not uploaded yet and I am about 25% through with the next WU.

Am still new to all this but should the completed WU upload before grabbing a new one? Is there a way I can force an upload or will it try again in it's own time?

Cheers
 
xgeek said:
Still seems to be playing up. My Wu from yesterday has not uploaded yet and I am about 25% through with the next WU.

Am still new to all this but should the completed WU upload before grabbing a new one? Is there a way I can force an upload or will it try again in it's own time?

Cheers
The client will do an auto-send of completed work every 6hrs - currently some servers are down due to the server room having a new UPS installed (kinda ironic eh? :p) and the servers that are still running are now so heavily loaded that not a lot is happening.

The downed servers should be back up this afternoon (morning at Stanford) though I would expect it to take quite a while to clear the backlog - best just to leave the client to do it automatically rather than forcing it.

The client has a 10 position queue which it can fill up with completed WUs before it becomes a problem so there's no problem there :)

edit: I've just read that they also have some servers that have gone down unexpectedly, possibly a networking issue, so these will probably be back up and working this afternoon too.
That new server hardware really can't come soon enough :o :p
 
having same problem here at work - smp unit finished at 2am this morning - at lease it's cruching its way through the next unit - (loving that batch file :D )

[03:41:15] Completed 20000 out of 1000000 steps (2 percent)
[03:55:45] Couldn't send HTTP request to server (wininet)
[03:55:45] + Could not connect to Work Server (results)
[03:55:45] (171.64.65.64:8080)
[03:55:45] - Error: Could not transmit unit 01 (completed April 13) to work server.
[03:55:45] - 3 failed uploads of this unit.


[03:55:45] + Attempting to send results
[03:55:45] - Reading file work/wuresults_01.dat from core
[03:55:45] (Read 27055656 bytes from disk)
[03:55:57] Error: Got status code 502 from server
[03:55:57] + Could not connect to Work Server (results)
[03:55:57] (171.65.103.100:8080)
[03:55:57] Could not transmit unit 01 to Collection server; keeping in queue.
[03:55:57] + Sent 0 of 1 completed units to the server
[03:55:57] - Autosend completed

What happens if unit is upload after deadline, even though complete beforehand?
 
phew!!!... xgeek has been gaining on me all week!!! Gives me a chance to pul away ;)

Hope you sort it though :)

Stelly
 
shadowscotland said:
having same problem here at work - smp unit finished at 2am this morning - at lease it's cruching its way through the next unit - (loving that batch file :D )

[03:41:15] Completed 20000 out of 1000000 steps (2 percent)
[03:55:45] Couldn't send HTTP request to server (wininet)
[03:55:45] + Could not connect to Work Server (results)
[03:55:45] (171.64.65.64:8080)
[03:55:45] - Error: Could not transmit unit 01 (completed April 13) to work server.
[03:55:45] - 3 failed uploads of this unit.


[03:55:45] + Attempting to send results
[03:55:45] - Reading file work/wuresults_01.dat from core
[03:55:45] (Read 27055656 bytes from disk)
[03:55:57] Error: Got status code 502 from server
[03:55:57] + Could not connect to Work Server (results)
[03:55:57] (171.65.103.100:8080)
[03:55:57] Could not transmit unit 01 to Collection server; keeping in queue.
[03:55:57] + Sent 0 of 1 completed units to the server
[03:55:57] - Autosend completed

What happens if unit is upload after deadline, even though complete beforehand?

You get 0 points I'm afraid. I'd try turning use IE settings off, since the reason you couldn't contact the work server, was an IE networking issue.

However, it may take a few goes to get the WU through, since most of the servers that are online are currently taking a serious battering thanks to some networking problems at Stanford.
 
uncle_fungus said:
You get 0 points I'm afraid.
:eek:

uncle_fungus said:
However, it may take a few goes to get the WU through, since most of the servers that are online are currently taking a serious battering thanks to some networking problems at Stanford.

[09:37:05] Completed 210000 out of 1000000 steps (21 percent)
[09:53:05] Timered checkpoint triggered.
[09:55:57] - Autosending finished units...
[09:55:57] Trying to send all finished work units


[09:55:57] + Attempting to send results
[09:55:57] - Reading file work/wuresults_01.dat from core
[09:55:57] (Read 27055656 bytes from disk)
[09:56:08] Writing local files
[09:56:08] Completed 220000 out of 1000000 steps (22 percent)

the odd-itties continue - no thankyou or failed message (as no point given I guess it's failed) deadline doesn't run out untill 12noon tomorrow so that's another 4 auto uploads before nill pwa!
 
rich99million said:
It could still be uploading - people have reported upload speeds of 1KB/s to the servers at the moment :eek:

Really that slow :( - It didn't go

[10:53:08] Completed 250000 out of 1000000 steps (25 percent)
[10:56:03] Couldn't send HTTP request to server (wininet)
[10:56:03] + Could not connect to Work Server (results)
[10:56:03] (171.64.65.64:8080)
[10:56:03] - Error: Could not transmit unit 01 (completed April 13) to work server.
[10:56:03] - 4 failed uploads of this unit.


[10:56:03] + Attempting to send results
[10:56:03] - Reading file work/wuresults_01.dat from core
[10:56:03] (Read 27055656 bytes from disk)
[10:56:16] Error: Got status code 502 from server
[10:56:16] + Could not connect to Work Server (results)
[10:56:16] (171.65.103.100:8080)
[10:56:16] Could not transmit unit 01 to Collection server; keeping in queue.
[10:56:16] + Sent 0 of 1 completed units to the server
[10:56:16] - Autosend completed
 
hmm, just found the win SMP on my server had stalled at around 7 am this morning (6 am on the logs). Looks like the inability to upload just caused it all to hang, even though it probably got a new WU downloaded.

I was running it as a service (yep I know your not supposed to but thats the only way I can do it as a user isn't left logged in for security). I stopped and restarted, and it's crunching a new WU happily, but there is no sign the old one got uploaded - I suspect it dissappeared. :(
 
Oh, dear whatever the problem is with Stanford's servers, it's causing the SMP stuff to hang whenever a WU ends :( Nothings getting uploaded either.

Restart the SMP client and it's got a WU to start crunching so the downloads going, not the upload.
 
Code:
[10:45:47] + Attempting to send results
[10:45:47] - Reading file work/wuresults_07.dat from core
[10:45:47]   (Read 5506529 bytes from disk)
[10:45:47] Connecting to [url]http://171.65.103.100:8080/[/url]
[14:38:54] - Autosending finished units...
[14:38:54] Trying to send all finished work units
[COLOR=DarkOrange][B][14:38:54] - Already sending work[/B][/COLOR]
[14:38:54] + Sent 0 of 1 completed units to the server
[14:38:54] - Autosend completed
Guess it's still uploading, started at 10:45 lol.

Edit: nope, nothing happening with the network. Gonna restart it.

Edit2: restarted, sending now, and maxing out my upload. Woo
 
Last edited:
Back
Top Bottom