Computer cluster: Difference between revisions

Content deleted Content added
No edit summary
Citation bot (talk | contribs)
Removed proxy/dead URL that duplicated identifier. | Use this bot. Report bugs. | #UCB_CommandLine 894/9214
Line 116:
 
===Debugging and monitoring===
Developing and debugging parallel programs on a cluster requires parallel language primitives and suitable tools such as those discussed by the ''High Performance Debugging Forum'' (HPDF) which resulted in the HPD specifications.<ref name="Prabhu"/><ref name=iosp>{{cite journal|last1=Francioni|first1=Joan M.|last2=Pancake|first2=Cherri M.|author-link2=Cherri M. Pancake|title=A Debugging Standard for High-performance computing|journal=Scientific Programming|volume=8|issue=2|date=April 2000|url=http://dl.acm.org/citation.cfm?id=1239906|doi=10.1155/2000/971291|publisher=IOS Press|issn=1058-9244|pages=95–108|___location=[[Amsterdam]], [[Netherlands]]|doi-access=free}}</ref> Tools such as [[Rogue Wave Software|TotalView]] were then developed to debug parallel implementations on computer clusters which use [[Message Passing Interface]] (MPI) or [[Parallel Virtual Machine]] (PVM) for message passing.
 
The [[University of California, Berkeley]] ''Network of Workstations'' (NOW) system gathers cluster data and stores them in a database, while a system such as PARMON, developed in India, allows visually observing and managing large clusters.<ref name="Prabhu"/>