Associate
- Joined
- 9 Mar 2008
- Posts
- 1,039
Hi all - i set up the linux client (on Ubuntu) in VMware in the last few days and it has been crunching away happily, and tonight finished its first WU. However, i have had a few problems... The WU finished, and the client shutdown, but the terminal window showed a message along the lines of 'no access to mpich' - i think (unforutnately i did not keep the screen open to write it down, and FAHmon and the log files do not seem to have recorded the error). The log file reads as follows:
The client now does not seem to be able to do anything (keeps repeating the bottom section of the code), which is a pain. I am loathed to delete the work folder etc since i have no idea if it sent the finished unit, or how i can solve this problem if it happens again. Any help would be greatly appreciated - im pretty new to this VMware thing!
Code:
[23:20:16] Completed 247500 out of 250000 steps (99%)
[23:30:51] Completed 250000 out of 250000 steps (100%)
[23:31:52]
[23:31:52] Finished Work Unit:
[23:31:52] - Reading up to 21141072 from "work/wudata_01.trr": Read 21141072
[23:31:54] trr file hash check passed.
[23:31:54] - Reading up to 27623324 from "work/wudata_01.xtc": Read 27623324
[23:31:55] xtc file hash check passed.
[23:31:55] edr file hash check passed.
[23:31:55] logfile size: 181538
[23:31:55] Leaving Run
[23:31:59] - Writing 49176798 bytes of core data to disk...
[23:31:59] ... Done.
[23:32:09] - Shutting down core
[23:32:09]
[23:32:09] Folding@home Core Shutdown: FINISHED_UNIT
[23:34:12] ***** Got an Activate signal (2)
[23:34:12] Killing all core threads
Folding@Home Client Shutdown.
--- Opening Log file [October 31 23:34:20]
# SMP Client ##################################################################
###############################################################################
Folding@Home Client Version 6.02
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: /home/chris/folding
Executable: ./fah6
Arguments: -smp -verbosity 9 -oneunit
[23:34:20] - Ask before connecting: No
[23:34:20] - User name: ChrissyT88 (Team 10)
[23:34:20] - User ID: 6863B6A61C0C450E
[23:34:20] - Machine ID: 1
[23:34:20]
[23:34:20] Loaded queue successfully.
[23:34:20] - Autosending finished units...
[23:34:20] Trying to send all finished work units
[23:34:20] + No unsent completed units remaining.
[23:34:20] - Autosend completed
[23:34:20]
[23:34:20] + Processing work unit
[23:34:20] Core required: FahCore_a2.exe
[23:34:20] Core found.
[23:34:20] Working on Unit 01 [October 31 23:34:20]
[23:34:20] + Working ...
[23:34:20] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 01 -checkpoint 15 -verbose -lifeline 6449 -version 602'
[23:34:20]
[23:34:20] *------------------------------*
[23:34:20] Folding@Home Gromacs SMP Core
[23:34:20] Version 2.01 (Wed Aug 13 13:11:25 PDT 2008)
[23:34:20]
[23:34:20] Preparing to commence simulation
[23:34:20] - Ensuring status. Please wait.
[23:34:20] Files status OK
[23:34:20]
[23:34:20] Project: 0 (Run 0, Clone 0, Gen 0)
[23:34:20]
[23:34:20] Error: Could not write local file. Exiting.
[23:34:20] - Shutting down core
[23:34:30] one 0, Gen 0)
[23:34:30]
[23:34:30] Error: Could not write local file. Exiting.
[23:34:30] - Shutting down core
The client now does not seem to be able to do anything (keeps repeating the bottom section of the code), which is a pain. I am loathed to delete the work folder etc since i have no idea if it sent the finished unit, or how i can solve this problem if it happens again. Any help would be greatly appreciated - im pretty new to this VMware thing!
Last edited: