<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>NOVATTE &#124; High Performance Computing Blog</title>
	<atom:link href="http://novatte.com/blog/feed/" rel="self" type="application/rss+xml" />
	<link>http://novatte.com/blog</link>
	<description></description>
	<lastBuildDate>Wed, 14 Mar 2012 04:56:15 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Benefits of GPU computing systems comparing to CPU-based systems</title>
		<link>http://novatte.com/blog/2012/03/benefits-of-gpu-computing-systems-comparing-to-cpu-based-systems/</link>
		<comments>http://novatte.com/blog/2012/03/benefits-of-gpu-computing-systems-comparing-to-cpu-based-systems/#comments</comments>
		<pubDate>Wed, 14 Mar 2012 04:54:39 +0000</pubDate>
		<dc:creator>NOVATTE</dc:creator>
				<category><![CDATA[HPC]]></category>
		<category><![CDATA[GPU Computing]]></category>

		<guid isPermaLink="false">http://novatte.com/blog/?p=104</guid>
		<description><![CDATA[Take a look at this nice video which explains advantages of GPU computing systems VS systems based on CPUs only. MythBusters hosts nailed it. This video is a Great start in the World of GPU Computing!]]></description>
			<content:encoded><![CDATA[<p>Take a look at this nice video which explains advantages of GPU computing systems VS systems based on CPUs only. MythBusters hosts nailed it. This video is a Great start in the World of GPU Computing!</p>
<p><span style="text-align:center; display: block;"><a href="http://novatte.com/blog/2012/03/benefits-of-gpu-computing-systems-comparing-to-cpu-based-systems/"><img src="http://img.youtube.com/vi/taC5OHY-Ck0/2.jpg" alt="" /></a></span></p>
]]></content:encoded>
			<wfw:commentRss>http://novatte.com/blog/2012/03/benefits-of-gpu-computing-systems-comparing-to-cpu-based-systems/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>5 reasons to consider switching to Intel E5-2600 series CPUs starting from yesterday</title>
		<link>http://novatte.com/blog/2012/03/5-reasons-to-consider-switching-to-intel-e5-2600-series-cpus-starting-from-yesterday/</link>
		<comments>http://novatte.com/blog/2012/03/5-reasons-to-consider-switching-to-intel-e5-2600-series-cpus-starting-from-yesterday/#comments</comments>
		<pubDate>Fri, 09 Mar 2012 08:08:47 +0000</pubDate>
		<dc:creator>NOVATTE</dc:creator>
				<category><![CDATA[HPC]]></category>

		<guid isPermaLink="false">http://novatte.com/blog/?p=79</guid>
		<description><![CDATA[On Mar 6, 2012 Intel announced new sever CPUs for dual-CPU platform &#8211; Sandy Bridge E5-2600-series in which by adding just 2 additional cores it made E5-2600 almost 2 times faster than the previous X5600 generation! How did they do it and why one should consider switching to E5-based system? Architecture difference between X5600 series and E5-2600 series Reason 1: &#8230; <a href="http://novatte.com/blog/2012/03/5-reasons-to-consider-switching-to-intel-e5-2600-series-cpus-starting-from-yesterday/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>On Mar 6, 2012 Intel announced new sever CPUs for dual-CPU platform &#8211; Sandy Bridge E5-2600-series in which by adding just 2 additional cores it made E5-2600 almost 2 times faster than the previous X5600 generation! How did they do it and why one should consider switching to E5-based system?</p>
<p><em><strong>Architecture difference between X5600 series and E5-2600 series</strong><br />
</em>
<a href="http://novatte.com/blog/wp-content/gallery/article-images/x5600vse5-2600.jpg" title="" class="shutterset_singlepic3" >
	<img class="ngg-singlepic" src="http://novatte.com/blog/wp-content/gallery/cache/3__996x280_x5600vse5-2600.jpg" alt="x5600vse5-2600" title="x5600vse5-2600" />
</a>
</p>
<p><strong>Reason 1: increased Bandwidth.</strong> The second Quick Path Interconnect (QPI) link was added to the CPU and the speed of QPI itself was increased from 6.4 Giga Transfers per second (GT/s) for X5600 series to 8GT/s for E5-2600 series resulting in huge performance increase through increased intra-CPU communication speed.</p>
<p><strong>Reason 2: more CPU cores.</strong> The number of cores increased from 6 to 8, and even though the cores became smaller, the number of instructions per cycle increased from 4 (X5600 series) to 8 (E5-2600 series) in AVX resulting almost <a href="http://novatte.com/blog/2012/03/how-to-calculate-theoretical-peak-performance-of-a-cpu-based-hpc-system/">100% performance increase for floating-point operations</a>!</p>
<p><strong>Reason 3: faster memory speed and increase RAM capacity.</strong> The number of memory channels increased from 3 to 4 per CPU plus Sandy Bidge can now support DDR3-1600 memory of up to 32GB per module (comparing to DDR3-1333 maximum memory for X5600 series) providing a maximum memory capacity of <a href="http://novatte.com/cloud-computing-servers/cloudbee-c2100pr">768GB RAM per dual-CPU system</a>.</p>
<p><em>To summarize the differences between X5600 series and E5-2600 series platforms:</em></p>
<table width="438" border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="top" width="142"></td>
<td valign="top" width="142">
<p style="text-align: center;"><strong><em>X5600 series<br />
platform</em></strong></p>
</td>
<td valign="top" width="154">
<p style="text-align: center;"><strong><em>E5-2600 series platform</em></strong></p>
</td>
</tr>
<tr>
<td valign="top" width="142">Processor</td>
<td valign="top" width="142">One QPI 6.4GT/s link<br />
6 cores / 12 threads / 12MB cache<br />
Turbo 1.0</td>
<td valign="top" width="154">Two QPI 8.0 GT/s links<br />
8 cores / 16 threads / 20MB cache<br />
Turbo 2.0<br />
AVX</td>
</tr>
<tr>
<td valign="top" width="142">Memory</td>
<td valign="top" width="142">3 channels<br />
Up to 1333 MHz<br />
Up to 18 DIMMs<br />
Up to 288GB per system</td>
<td valign="top" width="154">4 channels<br />
Up to 1600MHz<br />
Up to 24 DIMMs<br />
Up to 768GB per system<br />
LRDIMMs</td>
</tr>
<tr>
<td valign="top" width="142">I/O</td>
<td valign="top" width="142">Two-chipIOH/ICHUp to 32 lanes of PCI Express 2.0</td>
<td valign="top" width="154">Integrated I/OUp to 80 lanes of PCI Express 3.0DDIO</td>
</tr>
<tr>
<td valign="top" width="142">Power Management</td>
<td valign="top" width="142">NM 1.5</td>
<td valign="top" width="154">NM 2.0</td>
</tr>
</tbody>
</table>
<p>&nbsp;</p>
<p><strong> Reason 4: x2 performance increase</strong></p>
<p><em>Server CPU Theoretical Peak Performance comparison:</em></p>
<table width="420" border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="top" width="57"></td>
<td colspan="2" valign="top" width="201">
<p style="text-align: center;"><strong><em>X5600 series CPUs</em></strong></p>
</td>
<td colspan="2" valign="top" width="163">
<p style="text-align: center;"><strong><em>E5-2600 series CPUs</em></strong></p>
</td>
</tr>
<tr>
<td valign="top" width="57"></td>
<td valign="top" width="78">
<p style="text-align: center;">Part Number</p>
</td>
<td style="text-align: center;" valign="top" width="123">Performance, GFLOPS</td>
<td style="text-align: center;" valign="top" width="84">Part Number</td>
<td valign="top" width="80">
<p style="text-align: center;">Performance, GFLOPS<strong></strong></p>
</td>
</tr>
<tr>
<td valign="top" width="57">Top BinLow bin</td>
<td valign="top" width="78">X5690<br />
X5680<br />
X5675<br />
X5670<br />
X5660<br />
X5650</td>
<td valign="top" width="123">83.04<br />
79.92<br />
73.44<br />
70.32<br />
67.2<br />
63.84</td>
<td valign="top" width="84">E5-2690<br />
E5-2680<br />
E5-2670<br />
E5-2665<br />
E5-2660<br />
E5-2650</td>
<td valign="top" width="80">185.6<br />
172.8<br />
166.4<br />
153.6<br />
140.8<br />
128</td>
</tr>
</tbody>
</table>
<p>&nbsp;</p>
<p><strong>Reason 5: great price / performance</strong></p>
<p><em>Price-performance comparison</em></p>
<table border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="top" width="142">
<p style="text-align: center;"><strong>X5600 series VS E5-2600 series</strong></p>
</td>
<td valign="top" width="142">
<p style="text-align: center;"><strong>Price<br />
</strong><strong>Difference, Times</strong></p>
</td>
<td style="text-align: center;" valign="top" width="142"><strong>Performance, Difference, Times</strong></td>
</tr>
<tr>
<td style="text-align: center;" valign="top" width="142">X5690 / E5-2690<br />
X5680 / E5-2680<br />
X5675 / E5-2670<br />
X5670 / E5-2665<br />
X5660 / E5-2660<br />
X5650 / E5-2650</td>
<td style="text-align: center;" valign="top" width="142">1.25<br />
1.05<br />
1.09<br />
1.01<br />
1.10<br />
1.12</td>
<td valign="top" width="142">
<p style="text-align: center;">2.24<br />
2.16<br />
2.27<br />
2.18<br />
2.10<br />
2.01</p>
</td>
</tr>
</tbody>
</table>
<p>In other words for around 10% increase in price one gets more than 100% increase in performance which in tern means that one will need to buy twice as less servers to complete the same task!</p>
]]></content:encoded>
			<wfw:commentRss>http://novatte.com/blog/2012/03/5-reasons-to-consider-switching-to-intel-e5-2600-series-cpus-starting-from-yesterday/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to calculate theoretical peak performance of a CPU-based HPC system</title>
		<link>http://novatte.com/blog/2012/03/how-to-calculate-theoretical-peak-performance-of-a-cpu-based-hpc-system/</link>
		<comments>http://novatte.com/blog/2012/03/how-to-calculate-theoretical-peak-performance-of-a-cpu-based-hpc-system/#comments</comments>
		<pubDate>Thu, 08 Mar 2012 07:54:20 +0000</pubDate>
		<dc:creator>NOVATTE</dc:creator>
				<category><![CDATA[HPC]]></category>

		<guid isPermaLink="false">http://novatte.com/blog/?p=72</guid>
		<description><![CDATA[To calculate theoretical peak performance of the HPC system we first need to calculate theoretical peak performance of one node (server) in GFLOPS and than just to multiply node performance on the number of nodes your HPC system has. HPC world is using the following formulae for node theoretical peak performance: Node performance in GFLOPS = (CPU speed in GHz) x &#8230; <a href="http://novatte.com/blog/2012/03/how-to-calculate-theoretical-peak-performance-of-a-cpu-based-hpc-system/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>To calculate theoretical peak performance of the HPC system we first need to calculate theoretical peak performance of one node (server) in GFLOPS and than just to multiply node performance on the number of nodes your HPC system has. HPC world is using the following formulae for node theoretical peak performance:</p>
<p><em><strong>Node performance in GFLOPS = (CPU speed in GHz) x (number of CPU cores) x (CPU instruction per cycle) x (number of CPUs per node).</strong></em></p>
<p><em>Note:<br />
</em><em>-       Intel X5600 series CPUs and AMD 6100/6200 series CPUs have 4 instructions per cycle<br />
</em><em>-       Intel E5-2600 series have 8 instructions per cycle</em></p>
<p>Example 1: Dual-CPU server based on Intel X5675 (3.06GHz 6-cores) CPUs:<br />
3.06 x 6 x 4 x 2 = 144.88 GFLOPS</p>
<p>Example 2: Dual-CPU server based on Intel E5-2670 (2.6GHz 8-cores) CPUs:<br />
2.6 x 8 x 8 x 2 = 332.8 GFLOPS<br />
<em>(Note that number instructions per cycle for E5-2600 series CPUs is equal to 8 )</em></p>
<p>Example 3: Dual-CPU server based on AMD 6176 (2.3GHz 12-cores) CPUs:<br />
2.3 x 12 x 4 x 2 = 220.8 GFLOPS</p>
<p>Example 4: Dual-CPU server based on AMD 6274 (2.2GHz 16-cores) CPUs:<br />
2.2 x 16 x 4 x 2 = 281.6 GFLOPS</p>
<p>&nbsp;</p>
<p>And now, after we know that our node performance – we just multiply that number to the number of nodes we have in our system to get the<strong> </strong>theoretical peak performance of a CPU-based HPC system (we’ll use the node from Example 2 above to calculate HPC system performance consisting of 8 nodes):<br />
332.8 GFLOPS x 8 = 2,442.4 GFLOPS = 2.44 TFLOPS</p>
]]></content:encoded>
			<wfw:commentRss>http://novatte.com/blog/2012/03/how-to-calculate-theoretical-peak-performance-of-a-cpu-based-hpc-system/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Some words about Efficiency</title>
		<link>http://novatte.com/blog/2011/11/some-words-about-efficiency/</link>
		<comments>http://novatte.com/blog/2011/11/some-words-about-efficiency/#comments</comments>
		<pubDate>Thu, 10 Nov 2011 23:18:28 +0000</pubDate>
		<dc:creator>oxana</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[GPU Computing]]></category>
		<category><![CDATA[New Installations]]></category>

		<guid isPermaLink="false">http://itmo.asia/blog/?p=6</guid>
		<description><![CDATA[It has been a busy and productive period for NOVATTE. Our team just finished deployment of powerful GPU cluster for one of the Singapore Government agencies. The cluster built on new NVIDIA Tesla S2050 and NOVATTE Twin servers has the peak theoretical performance of 16 TFLOPS. The actual performance measured by the LINPACK Benchmark is 12.45 TFLOPS, providing efficiency coefficient &#8230; <a href="http://novatte.com/blog/2011/11/some-words-about-efficiency/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>It has been a busy and productive period for NOVATTE. Our team just finished deployment of powerful GPU cluster for one of the Singapore Government agencies. The cluster built on new NVIDIA Tesla S2050 and NOVATTE Twin servers has the peak theoretical performance of 16 TFLOPS. The actual performance measured by the LINPACK Benchmark is 12.45 TFLOPS, providing efficiency coefficient of 77.81%. Great result for a cluster based on open source software. To compare, the second largest cluster in the world accelerated by NVIDIA Tesla C2050 GPUs has efficiency of 42.58% (<a>http://www.top500.org/list/2010/06/100</a>). Now NOVATTE HPC cluster is to facilitate efficiency of the customers, saving their time and resources.</p>
]]></content:encoded>
			<wfw:commentRss>http://novatte.com/blog/2011/11/some-words-about-efficiency/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Welcome</title>
		<link>http://novatte.com/blog/2011/10/welcome/</link>
		<comments>http://novatte.com/blog/2011/10/welcome/#comments</comments>
		<pubDate>Mon, 10 Oct 2011 23:18:05 +0000</pubDate>
		<dc:creator>oxana</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://itmo.asia/blog/?p=4</guid>
		<description><![CDATA[Hi There! We are glad you stopped by. We are in the process of creating blog for NOVATTE. This blog will focus on issues, solutions and trends in high-performance computing (HPC) in Asia. Come back soon to exchange ideas on the topic, get insights and useful technical tips on different aspects of supercomputing.]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-full wp-image-52" title="stock-photo-17516337-welcome" src="http://novatte.com/blog/wp-content/uploads/2011/11/stock-photo-17516337-welcome.jpg" alt="" width="110" height="73" />Hi There! We are glad you stopped by. We are in the process of creating blog for NOVATTE. This blog will focus on issues, solutions and trends in high-performance computing (HPC) in Asia.</p>
<p>Come back soon to exchange ideas on the topic, get insights and useful technical tips on different aspects of supercomputing.</p>
]]></content:encoded>
			<wfw:commentRss>http://novatte.com/blog/2011/10/welcome/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Served from: novatte.com @ 2012-05-19 08:15:56 -->
