Gauntlet 2009 Joiners Thread!

Stuart Gibson · 5 Oct 2009 at 18:22

The Server Status page would seem to indicate that things are looking a lot healthier now.

More in the green than not!

Stuart Gibson · 5 Oct 2009 at 18:46

Just the splitters left to kick now.

Uploads progressing nicely. :cool:

Might even get some new work tonight!

Marine Iguana · 5 Oct 2009 at 19:33

Woo Hoo got WU's downloading now

Edit: Damm got a power cut posting from phone aghh

loudbob · 5 Oct 2009 at 20:04

Gonna take some time to upload results & get more work....

I only had 6 hours work left.

Stuart Gibson · 5 Oct 2009 at 20:09

The data pipeline is maxxed out atm, so uploading and reporting is going to be iffy for quite some time.

It'll hopefully settle down by tomorrow lunchtime.

Marine Iguana · 5 Oct 2009 at 20:44

Sweet got power back on lol, that was a close one just started downloading WU's as well. Oh well

Stuart Gibson · 5 Oct 2009 at 21:01

Well, I've got everything uploaded and reported.

Only problem I've got now is no GPU work available for No.1 rig. :mad:

Guess all I can do is check it again in the morning before I go to work!

loudbob · 5 Oct 2009 at 21:06

Hope they get things sorted before I completly dry up of work.....

Stuart Gibson · 5 Oct 2009 at 21:19

Woo hoo!!

I got ONE GPU WU!! :rolleyes:

loudbob · 5 Oct 2009 at 21:26

Stuart Gibson said:
Woo hoo!!

I got ONE GPU WU!!

Congratulations,

Your prize tonight is approx 6 mins of extra work lol

Stuart Gibson · 5 Oct 2009 at 21:34

loudbob said:
Congratulations,

Your prize tonight is approx 6 mins of extra work lol

Yeah! rotf lmfao !!!!!

Stuart Gibson · 5 Oct 2009 at 21:41

Stuart Gibson said:
Yeah! rotf lmfao !!!!!

no... wait...

and there's more...

31 CPU, 31 GPU.

getting there... only a few hundred more to go ....

Keep 'em coming

loudbob · 5 Oct 2009 at 21:42

I guess things are improving then

Stuart Gibson · 5 Oct 2009 at 21:46

loudbob said:
I guess things are improving then

I hope so!

It's this damn dual GPU rig! :mad:

BOINC always seems to underestimate how much GPU work I need.

And a whole bunch of shorties will screw the whole thing up!! :eek:

loudbob · 5 Oct 2009 at 21:46

02/10/09 21:43:05 Internet access OK project servers may temporarily be down

At least you can get work

Marine Iguana · 5 Oct 2009 at 21:49

I got a hundred or so WU's downloaded so should keep me going for a bit

Edit: and still getting some

Stuart Gibson · 5 Oct 2009 at 21:54

loudbob said:
02/10/09 21:43:05 Internet access OK project servers may temporarily be down

At least you can get work

21:46:49 Project communication failed: attempting access to reference site
21:46:50 Internet access OK - project servers may be temporarily down.
21:46:52 SETI@home Scheduler request failed: Couldn't connect to server

21:48:02 SETI@home Scheduler request completed: got 11 new tasks

It's because everybody is trying to get work at the same time.

It just takes multiple attempts to get it!

Best thing to do is get a good nights kip, and let it take care of itself (easier said than done!)

loudbob · 5 Oct 2009 at 21:56

Stuart Gibson said:
Best thing to do is get a good nights kip, and let it take care of itself (easier said than done!)

Agreed........

I should have enough work (5 hours worth) until things settle down a bit.

Stuart Gibson · 5 Oct 2009 at 21:56

Marine Iguana said:
I got a hundred or so WU's downloaded so should keep me going for a bit

Edit: and still getting some

I'm not that lucky!

Berserker · 5 Oct 2009 at 22:18

Managed to get work everywhere that needed it (the Quad still had work so I left its network disabled - I'll pull the plug on that one later).

Laptop ran out of disk space (well, near enough for BOINC to complain). 6.8GB pagefile. Yup, that'll do it - and almost as big as the file that brought down SETI for the weekend...

Technical News said:
Okay that was an ugly weekend. On Saturday morning I came to realize that our master mysql database server (mork) had crashed. I was the only one available at the time so I came up to the lab and rebooted the thing. We really need to improve our remote kvm/power cycle situation. I babysat the reboot long enough to see that mysql was recovering, knowing though that the replica would be out of sync (and need to be regenerated from scratch during the next weekly backup).

But then everything else crashed, and also hard enough to require human intervention. This time Eric eventually came up on Sunday to try to reboot a series of servers, but to no avail - they kept locking up shortly after reboot.

So Monday morning (today) we came into the lab and started cleaning up the server situation. Eric finally found the cause of the latter, if not all, of our problems. We have a pseudo user account is the "user" that runs a lot of stuff, apache processes, cron jobs, some of the BOINC back end servers, etc. For some reason the .history file had grown to 8GB in size, and it was full of garbage. Not sure why just yet, but that meant every time one of the above processes started, the shell tried to read in this impossibly large history file. Oops. Once Eric deleted this file all these dams broke free and we were able to safely recover all the databases/etc. throughout our long morning.

- Matt