<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Cluster Connection</title>
	<atom:link href="http://www.clusterconnection.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.clusterconnection.com</link>
	<description>Simplify HPC.  Share the knowledge.</description>
	<lastBuildDate>Thu, 11 Feb 2010 15:47:32 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Putting the &#8220;Work&#8221; Back in &#8220;Workstation&#8221;</title>
		<link>http://www.clusterconnection.com/2009/11/putting-the-work-back-in-workstation/</link>
		<comments>http://www.clusterconnection.com/2009/11/putting-the-work-back-in-workstation/#comments</comments>
		<pubDate>Fri, 20 Nov 2009 21:39:28 +0000</pubDate>
		<dc:creator>Ian Miller</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Hardware]]></category>
		<category><![CDATA[Products and Promotions]]></category>
		<category><![CDATA[Cluster]]></category>
		<category><![CDATA[Cray]]></category>
		<category><![CDATA[CX1]]></category>
		<category><![CDATA[Dell]]></category>
		<category><![CDATA[workstation]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1644</guid>
		<description><![CDATA[Just back from SC09 in Portland – lots of &#8220;buzz&#8221; about Cray there, including our newest addition to the Cray CX family – the Cray CX1-iWS. If you haven&#8217;t heard about it yet, the iWS is an &#8220;integrated workstation&#8221; – a Microsoft Windows 7 workstation with a fully integrated HPC cluster running Microsoft Windows HPC [...]]]></description>
			<content:encoded><![CDATA[<p>Just back from SC09 in Portland – lots of &#8220;buzz&#8221; about Cray there, including our newest addition to the Cray CX family – the Cray CX1-iWS. If you haven&#8217;t heard about it yet, the iWS is an &#8220;integrated workstation&#8221; – a Microsoft Windows 7 workstation with a fully integrated HPC cluster running Microsoft Windows HPC Server 2008 – all in a deskside form factor. And with the launch of this product, we bring on board another powerhouse partner – Dell – who will be the exclusive reseller of the CX1-iWS. Go to the Dell site to check it out … <a href="http://content.dell.com/us/en/enterprise/hpcc-servers.aspx" target="_blank">http://content.dell.com/us/en/enterprise/hpcc-servers.aspx</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/11/putting-the-work-back-in-workstation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Does More Cores Mean Less Nodes?</title>
		<link>http://www.clusterconnection.com/2009/10/does-more-cores-mean-less-nodes/</link>
		<comments>http://www.clusterconnection.com/2009/10/does-more-cores-mean-less-nodes/#comments</comments>
		<pubDate>Fri, 09 Oct 2009 18:19:06 +0000</pubDate>
		<dc:creator>Douglas Eadline</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[nodes]]></category>
		<category><![CDATA[Processor Cores]]></category>
		<category><![CDATA[revenue]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1479</guid>
		<description><![CDATA[Packing cores into a node means less servers are needed, but the market is still growing.
Ever since the shift to multi-core processors began, I have always had a nagging question &#8211; Does more cores mean less nodes? I have wrestled with this question and finally realized that there is no simple answer. I should preface [...]]]></description>
			<content:encoded><![CDATA[<p><em>Packing cores into a node means less servers are needed, but the market is still growing.</em></p>
<p>Ever since the shift to multi-core processors began, I have always had a nagging question &#8211; <em>Does more cores mean less nodes?</em> I have wrestled with this question and finally realized that there is no simple answer. I should preface this post by stating that I am talking about HPC and not the general market place where multi-core and virtualization are all the rage.</p>
<p>To understand why I ask this, consider that back when Linux clustering began, the single core Pentium Pro and Dec Alpha where the two processors of choice. Large clusters were maybe 64 or 128 nodes (often tower cases), which translated into 64 or 128 cores. Today you can easily pack 128 cores into 16 nodes &#8211; not even a full rack chassis. Given this trend, is the node count for clusters getting smaller?</p>
<p>From my anecdotal evidence, I seem to notice several trends. First, the HPC market, after a downturn, seems to be growing again and is projected to reach $11.7 billion by 2012 (<a href="http://www.hpcwire.com/topic/systems/IDC-HPC-Will-Resume-Growth-After-Dipping-in-2009-38620187.html">IDC</a>). This revenue is up from a $9.6 billion figure for 2008. Thus, the hunger for nodes is increasing and not deceasing, which begs a further question. <em>Are node counts increasing or are more people buying clusters?</em> (i.e. instead of a few people buying larger clusters, are there a lot of people buying smaller manageable clusters.) For the marketing types out there, maybe you know the answer. If not, that is a good question to ask. Leave a comment and give us a clue.</p>
<p>Second, from my experience there is plenty of HPC work to go around. Whether nodes are in a large data center cluster or in a small local blade system, it seems the cores are busy. Perhaps we are seeing the rise of the <em>Closet Cluster? </em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/10/does-more-cores-mean-less-nodes/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Platform HPC Workgroup Manager v1.1</title>
		<link>http://www.clusterconnection.com/2009/10/platform-hpc-workgroup-manager-v11/</link>
		<comments>http://www.clusterconnection.com/2009/10/platform-hpc-workgroup-manager-v11/#comments</comments>
		<pubDate>Thu, 08 Oct 2009 22:47:05 +0000</pubDate>
		<dc:creator>Samu Gábor</dc:creator>
				<category><![CDATA[Products and Promotions]]></category>
		<category><![CDATA[Platform Cluster Manager PCM MPI Dynamic Provisioning Microsoft Windows]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1594</guid>
		<description><![CDATA[Platform Computing is pleased to announce the availability of Platform HPC Workgroup Manager v1.1.
Platform HPC Workgroup Manager v1.1 is sold through Platform Computing partners and is for use in clusters containing 32-nodes or less.  The bundle includes the following:
Platform Cluster Manager (PCM) v1.2 – Standard Edition
Platform LSF Workgroup 7 Update 5
Platform MPI v5.6.6
The offering [...]]]></description>
			<content:encoded><![CDATA[<p>Platform Computing is pleased to announce the availability of Platform HPC Workgroup Manager v1.1.</p>
<p>Platform HPC Workgroup Manager v1.1 is sold through Platform Computing partners and is for use in clusters containing 32-nodes or less.  The bundle includes the following:</p>
<p>Platform Cluster Manager (PCM) v1.2 – Standard Edition<br />
Platform LSF Workgroup 7 Update 5<br />
Platform MPI v5.6.6</p>
<p>The offering includes the following capabilities:</p>
<p><strong>Dynamic Provisioning</strong><br />
Dynamic provisioning provides the ability to dynamically re-purpose nodes within a Platform LSF Workgroup cluster managed by Platform HPC Workgroup Manager.  The re-purposing is driven by LSF Workgroup (or manually) to match the OS configuration of the compute nodes according to the workload requirements. Support configurations include compute nodes configured as dual/multi-boot with Red Hat Enterprise Linux 5.3 and Microsoft Windows Server operating systems.</p>
<p><strong>Firstboot scripts</strong><br />
The Firstboot script functionality provides a simple means for IHVs to configure a series of scripts to be executed upon the initial boot of a PCM cluster headnode to prompt for key site specific information including network configuration, Platform LSF Workgroup and Platform MPI licensing.</p>
<p>For more details, please contact partnersales@platform.com (1-877-710-4522).</p>
]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/10/platform-hpc-workgroup-manager-v11/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Cluster Bookshelf</title>
		<link>http://www.clusterconnection.com/2009/10/the-cluster-bookshelf/</link>
		<comments>http://www.clusterconnection.com/2009/10/the-cluster-bookshelf/#comments</comments>
		<pubDate>Thu, 08 Oct 2009 22:46:44 +0000</pubDate>
		<dc:creator>Douglas Eadline</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Becker]]></category>
		<category><![CDATA[beowulf]]></category>
		<category><![CDATA[books]]></category>
		<category><![CDATA[Gropp]]></category>
		<category><![CDATA[reviews]]></category>
		<category><![CDATA[Sterling]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1579</guid>
		<description><![CDATA[An all too short list of HPC cluster books

Every so often I am asked about books on Beowulf/HPC clustering. The good news is there are books available at several levels. The bad news is no one book covers everything because the number of topic areas is so vast. To make your search easier, I have [...]]]></description>
			<content:encoded><![CDATA[<p><em>An all too short list of HPC cluster books<br />
</em></p>
<p>Every so often I am asked about books on Beowulf/HPC clustering. The good news is there are books available at several levels. The bad news is no one book covers everything because the number of topic areas is so vast. To make your search easier, I have surveyed all of the cluster books I know about. I have not included some that are out of print or don&#8217;t really discuss HPC. Some of the books are dated, but still contain good general information. Others are just plain old, but are included here for completeness. If you know of any other books, please add a comment. My list is as follows:</p>
<p><a href="http://www.phy.duke.edu/~rgb/Beowulf/beowulf_book.php">Engineering a Beowulf-Style Compute Cluster</a> by Robert G. Brown<br />
A freely available book (pdf) that discusses building and designing Beowulf style clusters. Robert Brown is a long time contributor to the Beowulf/HPC cluster community.</p>
<p><a href="http://my.safaribooksonline.com/0131448536">Building Clustered Linux Systems</a> by Robert W. Lucke<br />
A very good overview of cluster computing methods and hardware. The book provides a rather wide coverage of options, but does not dive too deep into any one approach. It is somewhat Hewlett Packard focused as author works for HP. (648 Pages) ISBN: 0-13-144853-6</p>
<p><a href="http://www.sun.com/x64/ebooks/hpc.jsp">HPC For Dummies</a> (an ebook) by Douglas Eadline<br />
A very short introduction to HPC, but not for dummies either! The ebook is available for free from Sun Microsystems after registration. The book provides backgound and best practices. (authored by yours truly)</p>
<p><a href="http://mitpress.mit.edu/catalog/item/default.asp?ttype=2&amp;tid=9947">Beowulf Cluster Computing with Linux</a>, Second Edition by William Gropp, Ewing Lusk and Thomas Sterling<br />
Updated edition, now edited by William Gropp, Ewing Lusk (in addition to Sterling). Provides a good, but high level view of Linux clustering. This edition includes ROCKS and OSCAR coverage plus other important issues. (504 pages) ISBN 0-262-69292-9</p>
<p><a href="http://mitpress.mit.edu/catalog/item/default.asp?ttype=2&amp;tid=8681">Beowulf Cluster Computing with Linux</a> by Thomas Sterling<br />
The next book after the original &#8220;How to Build a Beowulf&#8221; by Tom Sterling (see below). Coverage is much expanded, but now very high level as chapters are written by key players in the cluster community. (536 pages) ISBN 0-262-69274-0</p>
<p><a href="http://mitpress.mit.edu/catalog/item/default.asp?ttype=2&amp;tid=8682">Beowulf Cluster Computing with Windows</a> by Thomas Sterling<br />
Approximately 75% of the material is the same as the Linux book by Sterling (no file system coverage). Newer versions of Windows HPC Server 2008 make this a bit dated. If you need Windows clusters, you may want to start here. (488 pages) ISBN 0-262-69275-9</p>
<p><a href="http://mitpress.mit.edu/catalog/item/default.asp?ttype=2&amp;tid=3898">How to Build a Beowulf</a> by Thomas Sterling, John Salmon, Donald J. Becker and Daniel F. Savarese<br />
The first book on Beowulf cluster computing. Published in 1999, it is now quite dated. Both hardware and software have moved past the text. It does provide good coverage of the issues facing the cluster builder. (261 pages) ISBN 0-262-69218-X</p>
<p><a href="http://www.amazon.com/exec/obidos/tg/detail/-/0138997098/ref=pd_cps_eb_1/104-1154502-8274348?v=glance&amp;s=books">In Search of Clusters</a> by Gregory Pfister<br />
A good (and very technical) book about the advantages of cluster architectures. Also provides a very detailed analysis of programming models  This book seems to be out of print. Worth reading if you can find a copy. It does not cover Linux Beowulf clusters. (608 pages) ISBN 0138997098</p>
<p><a href="http://oreilly.com/catalog/9780596005702/">High Performance Linux Clusters with OSCAR, Rocks, OpenMosix, and MPI</a> by Joseph D. Sloan<br />
O&#8217;Reilly&#8217;s second attempt at a Linux cluster book. Many feel this second attempt has missed the mark (again). Also note, the The OpenMosix Project has officially closed as of March 1, 2008. (367 pages) ISBN: 0-596-00570-9</p>
<p><a href="http://www.redbooks.ibm.com/abstracts/sg246041.html">Linux HPC Cluster Installation</a> An IBM Redbooks publication<br />
This document focus on xCAT (xCluster Administration Tools) for installation and administration. All nodes and components of the cluster, such as compute nodes and management nodes, are installed with xCAT. Dated material.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/10/the-cluster-bookshelf/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Getting Ahead Without Leaving People Behind</title>
		<link>http://www.clusterconnection.com/2009/10/getting-ahead-without-leaving-people-behind/</link>
		<comments>http://www.clusterconnection.com/2009/10/getting-ahead-without-leaving-people-behind/#comments</comments>
		<pubDate>Thu, 08 Oct 2009 22:46:17 +0000</pubDate>
		<dc:creator>addisonsnell</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[HPC cores multi-core many-core forecast survey data]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1593</guid>
		<description><![CDATA[The HPC industry is known for pushing the latest technology, with the spotlight often looking beyond what’s new to start talking about what’s next. Before a petaflop system was installed, system architects began discussing what it would take to reach an exaflop, and new arenas like accelerators, solid state disks, and cloud computing dominate roundtable [...]]]></description>
			<content:encoded><![CDATA[<p>The HPC industry is known for pushing the latest technology, with the spotlight often looking beyond what’s new to start talking about what’s next. Before a petaflop system was installed, system architects began discussing what it would take to reach an exaflop, and new arenas like accelerators, solid state disks, and cloud computing dominate roundtable discussions.</p>
<p>In the processor world, that means we talk about not just multi-core, but many-core, and how users will take advantage of these vast arrays of computational power. But in our enthusiasm to embrace this abundance, it is important not to lose sight of the fact that the transition from single-core is far from complete.</p>
<p>Our 2009 High Performance Computing Site Census Survey found that as of April this year, 43% of the HPC installed base is single-core systems, making them still the leading processor type, ahead of dual-core (34%) and quad-core (20%).</p>
<p>To be sure, part of the reason for this is that some HPC users keep their systems in some form of employ for many years. The average life span of a cluster node is under three years, but many systems persist. We found about a quarter of the systems in our current census survey were installed before 2005, and over four-fifths of these were single-core.</p>
<p>Natural longevity isn’t the only contributing factor. First of all, some HPC users are still buying single-core systems. In our survey, one out of every seven systems installed since the beginning of 2008 was single core.</p>
<p>One potential reason for this might be that many users have not optimized their codes – or entered into ISV licensing agreements – to take multiple cores into account. This same phenomenon could lead HPC users to prolong the life of their existing single-core systems.</p>
<p>This is a significant, enduring issue for the HPC industry. The data points to a large population of the HPC market that may still help optimizing their environments for even dual-core, let alone quad, eight, and beyond.</p>
<p>Getting ahead of the competition is one part of the battle. Making sure your customers can keep up with you is the other.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/10/getting-ahead-without-leaving-people-behind/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Clustering in the Cloud</title>
		<link>http://www.clusterconnection.com/2009/10/clustering-in-the-cloud/</link>
		<comments>http://www.clusterconnection.com/2009/10/clustering-in-the-cloud/#comments</comments>
		<pubDate>Thu, 08 Oct 2009 22:45:49 +0000</pubDate>
		<dc:creator>Douglas Eadline</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[grid]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[InfiniBand]]></category>
		<category><![CDATA[Top500]]></category>
		<category><![CDATA[virtualization]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1569</guid>
		<description><![CDATA[Are clouds a good place to do build HPC Clusters?
The use of virtualization and multi-core processors has made cloud computing an option for many users. The ability to buy cloud time as you need it and not purchase hardware is certainly attractive from a financial standpoint. The concept is not new and has its roots [...]]]></description>
			<content:encoded><![CDATA[<p><em>Are clouds a good place to do build HPC Clusters?</em></p>
<p>The use of virtualization and multi-core processors has made cloud computing an option for many users. The ability to buy <em>cloud</em> time as you need it and not purchase hardware is certainly attractive from a financial standpoint. The concept is not new and has its roots in time shared mainframes and grid computing. One might assume the the vast amount of computing resources in clouds may make them ideal candidates for HPC clustering. Unfortunately, it is not as simple as collecting cores.</p>
<p>One of the issues facing clouds is I/O. Basically, I/O is often not predictable or repeatable. From a storage standpoint read and write times can be fast, but not always fast. In terms of messages between servers, most clouds do not support high performance interconnects and similarly make no guarantees as to latency or bandwidth consistency.  While grids paid attention to certain HPC performance guarantees in terms of I/O, clouds, in order to offer ease of use, have declined such guarantees. Unless a cloud has been specifically designed for HPC, the user cannot expect consistent and/or high performance. There are two papers which discuss this very idea. The first paper looks at <a href="http://www.usenix.org/publications/login/2008-10/openpdfs/walker.pdf">Benchmarking Amazon EC2 for High-performance Scientific Computing</a> and the second paper asks, <a href="http://www.cs.utexas.edu/users/pauldj/pubs/uchpc09.pdf">Can Cloud Computing Reach The TOP500?</a>. Both papers conclude that the cloud is not mature enough for HPC applications.</p>
<p>The limitations of the cloud become more apparent when one looks a little deeper at HPC applications. First, many applications rely on <em>user space</em> communication (i.e. high performance MPI programs transfer data directly from one node to another without using kernel services.) Such a <em>close to the wire</em> operation runs counter to the virtualization model. Secondly, as reported in the first paper (above), the performance of OpenMP applications was reduced by 7-21% when running in the EC2 cloud.</p>
<p>Recently Penguin Computing began offering <a href="http://www.penguincomputing.com/POD/Penguin_On_Demand">POD</a> (Penguin on Demand) for HPC cloud computing. The POD cloud offers both Ethernet and InfiniBand connections between nodes thus providing a dedicated high performance computing environment. This service can be considered a specialized HPC cloud.</p>
<p>There are some other other important issues to consider with cloud computing &#8212; security and reliability. When data leaves your domain over the Internet it is virtually impossible to guarantee 100% security. If your organization can live with this situation, using the cloud may be an option. If on the other hand, you need to keep a tight reign on your data, then you may not want to be injecting it into the cloud. The other issue is reliability. If your day to day operations are based on using a cloud, then a contingency plan is a must. Interruptions in Internet traffic due to congestion or hardware failures can be common in some areas. In addition, the cloud provider may have issues (even go out of business) and thus not meet the service requirements.</p>
<p>I believe the cloud is an interesting model, but it is not a real solution for HPC (in its current form). My issue with clouds is that they are often categorized as &#8220;grid like&#8221; and then are somehow (incorrectly) considered &#8220;HPC like.&#8221; Cloud offers utility computing like grid promised, but has pushed the application layer further away from the hardware. HPC practitioners spend a lot of time making sure the application is as close to the hardware as possible. At this point in time, HPC in the cloud is more of a curiosity than a solution. When examining HPC benchmarks it becomes clear that clouds are not the best means to provide HPC cycles. Whether efforts like POD can meet the HPC users needs in the cloud is still unknown.</p>
<p>To be fair, there are some HPC applications that lend themselves to clouds quite well. (i.e. those that do not require predictable I/O)  <a href="http://folding.stanford.edu/">Folding@home</a> and <a href="http://setiathome.berkeley.edu/">Seti@home</a> are two good examples. These applications could easily run in a cloud (in a sense they do run in the Internet cloud). Keep in mind they have been designed to work in a robust distributed fashion and are not virtualized. Clouds can be enticing and even enabling for some applications, but remember a collection of servers (in the cloud or in a rack) does not a cluster make.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/10/clustering-in-the-cloud/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Support, Why Do I Need Cluster Support?</title>
		<link>http://www.clusterconnection.com/2009/10/support-why-do-i-need-cluster-support/</link>
		<comments>http://www.clusterconnection.com/2009/10/support-why-do-i-need-cluster-support/#comments</comments>
		<pubDate>Thu, 08 Oct 2009 22:45:30 +0000</pubDate>
		<dc:creator>Douglas Eadline</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[Beowulf Mailing List]]></category>
		<category><![CDATA[bug fixes]]></category>
		<category><![CDATA[dependencies]]></category>
		<category><![CDATA[ICR]]></category>
		<category><![CDATA[support]]></category>
		<category><![CDATA[updates]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1536</guid>
		<description><![CDATA[Supporting a successful HPC cluster takes time and money, take your pick
Many of the HPC people I know are what you would call rugged individualists. The have been around since the beginning and were responsible for moving the market/community along when commodity HPC was less than fashionable. This group consists mostly of developers, implementers, and [...]]]></description>
			<content:encoded><![CDATA[<p><em>Supporting a successful HPC cluster takes time and money, take your pick</em></p>
<p>Many of the HPC people I know are what you would call <em>rugged individualists</em>. The have been around since the beginning and were responsible for moving the market/community along when commodity HPC was less than fashionable. This group consists mostly of developers, implementers, and administrators. Many of these people developed, by way of discussion on the <a href="http://www.beowulf.org/mailman/listinfo/beowulf">Beowulf Mailing List</a>, the best practices used today. The Beowulf Mailing list is a true resource if there ever was one. A <em>newbie</em> can ask a question and get polite (and lengthy) answers by list members. The list holds a large amount of open community knowledge because all the HPC plumbing is open source. Unhindered discussions can take place at any level between any number of people.</p>
<p>There is also the false notion that open source software is &#8220;free as in beer.&#8221; This idea is not quite true because software unlike toasters have a usage cost. Once you install, configure, and study any software you have already made an &#8220;investment&#8221; in the package. Continued use furthers this investment. The size of the investment is up to you. And, because the software is open, in theory you can fix any problem. Thus, you have the choice to decide how much time you can invest in a particular software package before it becomes &#8220;expensive&#8221; to you. At some point, the cost (or dilution) of your time may come into play. Spending weeks fine tuning a single application at the expense of other responsibilities is probably not going to play out very well.</p>
<p>In terms of support responsibilities, many clusters have minimal issues once they are configured correctly. There are, of course, hardware failures, but in general, once everything is booted, things often work quite well. There are two areas that need attention, however. The first is software updates. Updates are needed for several reasons including security, bug fix, or feature updates. These types of updates are usually easy to manage unless they have dependencies, which means there may be a whole raft of packages that need updating. If you don&#8217;t get the dependencies right, then there can be problems with the entire cluster.</p>
<p>The other issue is local integration. This is what I consider the &#8220;last mile problem&#8221; for clusters. Very often local file system issues need to be worked out and managed in addition to creating job submission policies. There is usually some end-user assistance needed as well as questions on how to compile and submit jobs to the queue. Of course, &#8220;Why is my job sitting in the queue?&#8221; is probably the question that gets asked the most.</p>
<p>If your job includes time for installing, integrating, and updating software and you happen to be one of those rugged individualist HPC people, then you probably have no interest in professional support. If on the other hand, you are new to clustering (or Linux) and already have many responsibilities, then you may want to consider using professional support services. As with all open source software, the choice is yours. In terms of commercial software, or commercial support of open software, there are many options. In any case, purchasing support for business critical applications is always a good idea. As is the use of a <a href="http://software.intel.com/en-us/cluster-ready/">Intel Cluster Ready</a> (ICR) solution. By adhering to the ICR specification support is much easier &#8212; for you and/or a vendor. That is, a reference platform allows you and your vendors to <em>work from the same page</em>. Without a common framework, support vendors and others in your organization may have to decipher/debug how you configured the cluster.</p>
<p>In conclusion, cluster support can be commercial or it can be institutional. In either case, there is a cost. If you do it on your own, it will cost time and if you hire a consultant or company, it will cost money. To supplement either effort, there is a large amount of information on the web that can be useful when identifying and solving problems. Support is an important part of any successful HPC cluster,  just ask the old-timers. They figured it out, so you don&#8217;t have to.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/10/support-why-do-i-need-cluster-support/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Intel Cluster Checker Experience</title>
		<link>http://www.clusterconnection.com/2009/10/the-intel-cluster-checker-experience/</link>
		<comments>http://www.clusterconnection.com/2009/10/the-intel-cluster-checker-experience/#comments</comments>
		<pubDate>Fri, 02 Oct 2009 20:40:51 +0000</pubDate>
		<dc:creator>Thomas Gebert</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[Hardware]]></category>
		<category><![CDATA[ICR]]></category>
		<category><![CDATA[Intel Cluster Checker]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1587</guid>
		<description><![CDATA[Well yes, this is another &#8220;cheers to the Intel Cluster Checker&#8221; blog and there must be reasons out there why people are writing about the Intel Cluster Ready program. Yes, there are reasons&#8230;
I think I do not have to state that setting up an HPC cluster sometimes is an adventure of compiling and installing different [...]]]></description>
			<content:encoded><![CDATA[<p>Well yes, this is another &#8220;cheers to the Intel Cluster Checker&#8221; blog and there must be reasons out there why people are writing about the Intel Cluster Ready program. Yes, there are reasons&#8230;</p>
<p>I think I do not have to state that setting up an HPC cluster sometimes is an adventure of compiling and installing different program versions and libraries. Nevertheless, when having all up and running, the Intel Cluster Checker is a really nice tool to verify that all services needed for your HPC cluster are set up correctly and you haven&#8217;t forgotten one of those tiny details during the installation of your cluster.</p>
<p>My last experience with the Intel Cluster Checker and the newly installed HPC cluster were a bit different from the ones before. The first steps ran smoothly and everything seemed to work fine, with me having learned from my previous first Intel Cluster Checker experience. The tests were successful, I altered the benchmark thresholds and all tests ended with a &#8220;passed&#8221;. But in the end, when checking the dmidecode output I got a &#8220;failed&#8221; on this test. First I thought this was due to some BIOS specific mismatches, which you can exclude. But when I had a closer look on the output file of the Intel Cluster Checker I saw that there seemed to be installed different RAM modules on the compute nodes.  Hmm&#8230;the sizes of the memories looked fine, all were 2GB and I double checked the product numbers of the DIMMs that the Intel Cluster Checker reported. Here we got two different types of memories installed. I was astonished on the one hand as I would not have checked this without the Intel Cluster Checker and on the other hand I hardly could believe that there were two different types of memory DIMMs installed. Well, the servers I had installed were Intel Nehalem based and I remembered the days when AMD started with the CPU built-in memory controller and the problems with memories that arose during those times&#8230;</p>
<p>I took a closer look at the compute nodes and opened the chassis of  those which were affected. And indeed, there were different types of memory modules built in, but they nearly looked the same and also had the same product number written down on them. After some investigation with our purchasing department I found out that those memory types have been mixed up. Unfortunately it was not possible to pin down the real cause for this confusion.</p>
<p>The DIMMs were swapped to the correct ones in the end and I ran the Intel Cluster Checker again. This time all tests were passed and the results were sent to Intel to verify the Cluster Ready Certificate, which now proudly resides beside the other ICRs we have scored.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/10/the-intel-cluster-checker-experience/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Skinny on Solid State Disks</title>
		<link>http://www.clusterconnection.com/2009/09/the-skinny-on-solid-state-disks/</link>
		<comments>http://www.clusterconnection.com/2009/09/the-skinny-on-solid-state-disks/#comments</comments>
		<pubDate>Tue, 22 Sep 2009 22:54:20 +0000</pubDate>
		<dc:creator>Douglas Eadline</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[IOPS]]></category>
		<category><![CDATA[JEDEC]]></category>
		<category><![CDATA[NAND flash]]></category>
		<category><![CDATA[OLTP]]></category>
		<category><![CDATA[SSD]]></category>
		<category><![CDATA[Storage]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1475</guid>
		<description><![CDATA[What you need to know about the latest trend in storage hardware
The Solid State Drive (SSD) has had a break-out year. Unlike traditional mechanical hard disk drives that use spinning platters with movable read/write heads, SSDs have no moving parts.  The SSDs is made entirely out of a special type of flash memory &#8212; [...]]]></description>
			<content:encoded><![CDATA[<p><em>What you need to know about the latest trend in storage hardware</em></p>
<p>The Solid State Drive (SSD) has had a break-out year. Unlike traditional mechanical hard disk drives that use spinning platters with movable read/write heads, SSDs have no moving parts.  The SSDs is made entirely out of a special type of flash memory &#8212; the same kind of NAND flash memory found in thumb-drives and memory sticks. Overall, SSDs are faster, quieter, more energy efficient, but less dense than the traditional spinning platter drive.</p>
<p>The staple of the storage industry has been the mechanical drives, where I/O rates are limited to the mechanical properties of a drive. Unlike other semiconductor trends, the sustained write rate has barely doubled from 50 to 90 MB/second over the past 8-10 years. All this is about to change as the use of flash memory will allow stroage to take advantage of a semiconductor growth curve similar to that of processors and memory. (i.e. mechanical drives are limited by the physical motion of spinning disks).</p>
<p>Perhaps the most important feature offered by SSDs is the read and write performance. The IOPS (I/Os per second) rate for an SSD is usually two to five times that of a traditional mechanical hard drive. When reading, performance is mostly constant because the seek time is virtually instantaneous and does not depend on the physical location of the data on a platter. As a result, file fragmentation has almost no impact on read performance. In addition, because there are no moving parts, SSDs use as little as one-fifth the power of a mechanical drive. Another interesting feature is the SSD failure mode. Most SSD failures tend to happen when writing. In contrast, mechanical drives tend to have most failures when reading. Thus, once data is written, it is more likely it can be read from a failed SSD.</p>
<p>SSDs do suffer from “degradation” over time that results in reduced performance and limited lifetimes (i.e. there are a limited amount of read/write cycles avaliable for NAND memory). Vendors have taken this into account and include <em>wear leveling</em> algorithms in SSDs that spread write access evenly over the entire device.</p>
<p>If you are interested in the exploring SSDs, there are some key points to consider. First, SSDs are not the best solution in every case. Currently, their capacity is much less than that of traditional mechanical drives and as such may not be suitable for some of the large HPC data sets. In addition, the cost is per MB is higher and are more susceptible to data loss from energy and power surges.</p>
<p>Second, the very fast read times offered by SSDs has made them a good candidate for improving OLTP (Online Transaction Processing) systems where frequently read tables and indexes can be accelerated. Check both the read and write IOPS, as there is usually a big difference between these values. Typically, the read speed is 10 times the write speed resulting in asymmetric performance. In terms of clusters, using SSDs for NFS mounts or read-only data may be helpful.</p>
<p>Third, because SSDs are new, questions still remain about how much of that speed they can deliver for the long haul (due to degradation). Typically, an SSD will show an initial decrease in performance and then level off. Even with a performance drop over time, SSD drives are almost always faster than traditional hard drives. The JEDEC standards organization plans to publish two standards by the end of this year for SSD endurance metrics.</p>
<p>Finally, pay particular attention to “write endurance,” this number should pertain to random writes. For instance, an Intel® X25-E Extreme 64 GB SATA Solid-State Drive is rated for 2 petabytes of lifetime random writes.</p>
<p>In terms of software, one difficulty facing the industry are the legacy assumptions built into file systems. These assumptions will need to be challenged in order to take advantage of SSD technology. For instance user applications and file systems will need to account for the asymmetric read/write performance of SSDs. Many computer applications rely on synchronous patterns of read/write operations, wherein a given write or update must be completed and the write confirmed before additional application read requests can be issued. With SSDs this process may need to be reconsidered.</p>
<p>There is no doubt that SSDs are the future of storage. Indeed, SSDs are even changing the way we compute. For example, CAE (Computer Aided Engineering) applications can use the speed advantage of SSDs in their out-of-core algorithms. The power of semiconductor manufacturing technology combined the speed of NAND flash memory are about to make the storage market stand still!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/09/the-skinny-on-solid-state-disks/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Future File Systems: Btrfs and ZFS</title>
		<link>http://www.clusterconnection.com/2009/09/future-file-systems-btrfs-and-zfs/</link>
		<comments>http://www.clusterconnection.com/2009/09/future-file-systems-btrfs-and-zfs/#comments</comments>
		<pubDate>Sat, 19 Sep 2009 02:51:34 +0000</pubDate>
		<dc:creator>Douglas Eadline</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[Btrfs]]></category>
		<category><![CDATA[File Systems]]></category>
		<category><![CDATA[linux]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[ZFS]]></category>

		<guid isPermaLink="false">http://www.clusterconnection.com/?p=1421</guid>
		<description><![CDATA[The prediction is in. What file system will move us into the future?
The thirst for better file system technology is not new to the Unix/Linux world. There is a rich history of trying to optimize the balance between the storage system and the process improvements in a demanding user environment. The efforts to build a [...]]]></description>
			<content:encoded><![CDATA[<p><em>The prediction is in. What file system will move us into the future?</em></p>
<p>The thirst for better file system technology is not new to the Unix/Linux world. There is a rich history of trying to optimize the balance between the storage system and the process improvements in a demanding user environment. The efforts to build a better file system are quite numerous and are built on the efforts of many people. For example Kirk McKusicks’ original Berkeley Fast File System improved on the original V7 release. Steven Tweedie’s ext3 took ideas from the database logs and Margo Seltzer’s LFS and added them to Linux’s implementation of UFS &#8212; Ted Tsao’s ext2.  In the mean time, DEC released Megasafe, SGI released XFS and Sun released ZFS all to the wild. And now Oracle has developed Btrfs for Linux.</p>
<p>So why do we as users care?  There better be a good reason to change a file system because a new file system usually means converting to and trusting a new format with your data. Thus, any new format or change must provide a compelling reason or solve a big problem. Otherwise what is &#8220;good enough and works&#8221; is often better than that which is &#8220;new and fancy.&#8221;</p>
<p>If you follow the details of file systems development, then this quick update may not be of interest to you. For the rest of us, who just choose whatever file system the installer offers, you may want to read further because changes are afoot.</p>
<p>If you are like me, you probably are running Linux with the ext3 file system. There is nothing wrong with ext3 as it is stable, robust, and a standard Linux file systems. And, one other thing, it is old. Even if you are running the newer ext4, you are still running a 30-year old file format that is more than a little short on features.</p>
<p>There are those that believe ext4 is going to be the end of the line and a switch over to Btrfs is very likely. Btrfs (pronounced &#8220;butter-F-S&#8221;) is being developed by Chris Mason at Oracle. It is an open source project that has recently been added to the Linux kernel (as of 2.6.29) as experimental code.</p>
<p>Btrfs is based on several new ideas including b-trees (binary trees, which is where the <em>btr</em> comes from in Btrfs) and &#8220;copy-on-write&#8221; or COW. While, I won&#8217;t go into the details, b-trees and COW allow for some new features that would be difficult in the ext* line of file systems. (If you want to learn more about the technical details of Btrfs, see  <a href="http://lwn.net/Articles/342892/">A short history of btrfs</a> on lwn.net.) Some of the new features include file-system snapshots, check-summing, online defragmentation, compression, extents, resizing, and more. In particular, Btrfs allows one thing that has been difficult to achieve in the past &#8212; optimizing both access time and disk space.</p>
<p>The fact that Oracle sponsors Btrfs has lead to some concern. Recently, Oracle purchased Sun Microsystems which has been developing the ZFS file system for many years. ZFS is similar to Btrfs (it uses COW) and provides many of the same features, but it is very different in its internal implementation. ZFS will also &#8220;run&#8221; under Linux using <a href="http://zfs-on-fuse.blogspot.com/">Fuse</a>. Mason and other have assured the community that Btrfs is important to Oracle and they will continue development. In addition, the open source nature of Btrfs ensures that it cannot be &#8220;taken away&#8221; now or in the future.</p>
<p>There is plenty more to consider and I suggest reading <a href="http://www.linux-mag.com/id/7308/">Linux Don&#8217;t Need No Stinkin&#8217; ZFS: BTRFS Intro &amp; Benchmarks</a> by my friend Jeff Layton. The consensus seems to be that Btrfs is destined to become the default Linux file systems within two years. ZFS on the other hand must overcome some licensing issues before it can even make it into the Linux kernel for testing. Your next Linux install may offer a new and better (or &#8220;btr&#8221;) file system than in the past.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.clusterconnection.com/2009/09/future-file-systems-btrfs-and-zfs/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
