<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Petapath Blog &#187; HPC</title>
	<atom:link href="http://www.petapath.com/blog/category/hpc/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.petapath.com/blog</link>
	<description>Musings on HPC and heterogeneous systems</description>
	<lastBuildDate>Tue, 15 Jun 2010 13:49:35 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1</generator>
		<item>
		<title>OpenCL 1.1 Specification Released</title>
		<link>http://www.petapath.com/blog/2010/06/15/opencl-1-1-specification-released/</link>
		<comments>http://www.petapath.com/blog/2010/06/15/opencl-1-1-specification-released/#comments</comments>
		<pubDate>Tue, 15 Jun 2010 13:49:35 +0000</pubDate>
		<dc:creator>Dairsie</dc:creator>
				<category><![CDATA[Heterogeneous]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[Khronos]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[OpenCL]]></category>
		<category><![CDATA[Developer]]></category>
		<category><![CDATA[Petapath]]></category>

		<guid isPermaLink="false">http://www.petapath.com/blog/?p=191</guid>
		<description><![CDATA[OpenCL 1.1 adds significant functionality including: New data types including 3-component vectors and additional image formats; Handling commands from multiple hosts and processing buffers across multiple devices; Operations on regions of a buffer including read, write and copy of 1D, 2D or 3D rectangular regions; Enhanced use of events to drive and control command execution; [...]]]></description>
			<content:encoded><![CDATA[<p><span style="font-family: Trebuchet MS,Verdana,Helvetica,sans-serif; color: #333333; font-size: x-small;"> </span></p>
<div>OpenCL   1.1 adds significant functionality including:</div>
<ul>
<li>New  data types including 3-component vectors and additional image formats;</li>
<li>Handling   commands from multiple hosts and processing buffers across multiple  devices;</li>
<li>Operations on regions of a buffer including read, write  and copy of 1D, 2D or 3D rectangular regions;</li>
<li>Enhanced use of  events to drive and control command execution;</li>
<li>Additional OpenCL  C built-in functions such as integer clamp, shuffle and asynchronous  strided copies;</li>
<li>Improved OpenGL interoperability through  efficient sharing of images and buffers by linking OpenCL and OpenGL  events.</li>
</ul>
<p><span style="color: #000000;">Full Press Release is <a href="http://r20.rs6.net/tn.jsp?et=1103480009991&amp;s=11073&amp;e=001I4cetm99eioFeuPQ46YN2G-5Twx4rjoNRL8_1tSDD2yhtrIEnZ3VN83XyUPKqEvUautNg4upO_g4Wkz7P6QxTl5fpmIlXDjdhUFt7sblW75gn5wvi-Z9uym04jyWTNmbnaUC2kXzfP3LCIQk8twq9R1e2FcI8xCbs8gJPnOBi7iJFJrGNV1etvQt83z3Aayb6yt1p3HzJrZVC4Ptqs-SxGj8cdqtCmhVoAFxXBDZEH8=" target="_blank">available here</a>.</span></p>
<p><span style="color: #000000;">If you are an NVIDIA registered developer you can download their OpenCL 1.1 Conformance Candidate and AMD will have OpenCL 1.1 support included with their next Steam SDK release.</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.petapath.com/blog/2010/06/15/opencl-1-1-specification-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>GPU Technology Conference 2010</title>
		<link>http://www.petapath.com/blog/2010/03/26/gpu-technology-conference-2010/</link>
		<comments>http://www.petapath.com/blog/2010/03/26/gpu-technology-conference-2010/#comments</comments>
		<pubDate>Fri, 26 Mar 2010 13:14:30 +0000</pubDate>
		<dc:creator>Dairsie</dc:creator>
				<category><![CDATA[CUDA]]></category>
		<category><![CDATA[gpgpu]]></category>
		<category><![CDATA[Heterogeneous]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[NVIDIA]]></category>
		<category><![CDATA[GPU Technology Conference]]></category>

		<guid isPermaLink="false">http://www.petapath.com/blog/?p=184</guid>
		<description><![CDATA[Having attended the 2009 GPU Technology conference it will be very interesting to see how much things have moved on in the intervening twelve months. For heterogeneous computing to really show it&#8217;s commercial potential, practical results need to be delivered on the back of the marketing impetus delivered by the unveiling of Fermi last year. [...]]]></description>
			<content:encoded><![CDATA[<p>Having attended the 2009 GPU Technology conference it will be very interesting to see how much things have moved on in the intervening twelve months. For heterogeneous computing to really show it&#8217;s commercial potential, practical results need to be delivered on the back of the marketing impetus delivered by the unveiling of Fermi last year. With Fermi based parts actually being available in the flesh this time round I expect there to be quite a lot of interesting announcements going on.</p>
<p>For the full press release see <a title="GPU Technology Conference 2010 Press Release" href="http://www.nvidia.com/object/io_1269574709099.html" target="_blank">here</a> and the landing page is <a title="GPU Technology Conference 2010" href="http://www.nvidia.com/object/gpu_technology_conference.html" target="_blank">here</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.petapath.com/blog/2010/03/26/gpu-technology-conference-2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>OpenCL Tutorial at Hot Chips 21</title>
		<link>http://www.petapath.com/blog/2009/08/21/opencl-tutorial-at-hot-chips-21/</link>
		<comments>http://www.petapath.com/blog/2009/08/21/opencl-tutorial-at-hot-chips-21/#comments</comments>
		<pubDate>Fri, 21 Aug 2009 10:58:04 +0000</pubDate>
		<dc:creator>Dairsie</dc:creator>
				<category><![CDATA[AMD]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[Heterogeneous]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[Khronos]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[OpenCL]]></category>

		<guid isPermaLink="false">http://www.petapath.com/blog/?p=145</guid>
		<description><![CDATA[The Khronos Group are presenting an OpenCL Tutorial Session at Hot Chips 21 this Sunday. presenters include Neil Trevett (Khronos President) and Affie Munshi (OpenCL Specification Editor), along with speakers from AMD, Intel, NVIDIA, Nokia and EA.]]></description>
			<content:encoded><![CDATA[<p>The Khronos Group are presenting an <a title="OpenCL Tutorial Session" href="http://www.hotchips.org/hc21/program/tutorials.htm" target="_blank">OpenCL Tutorial Session</a> at <a title="Hot Chips 21 Main Page" href="http://www.hotchips.org/hc21/main_page.htm" target="_blank">Hot Chips 21</a> this Sunday. presenters include Neil Trevett (Khronos President) and Affie Munshi (OpenCL Specification Editor), along with speakers from AMD, Intel, NVIDIA, Nokia and EA.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.petapath.com/blog/2009/08/21/opencl-tutorial-at-hot-chips-21/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Rapidmind gets gobbled</title>
		<link>http://www.petapath.com/blog/2009/08/20/rapidmind-gets-gobbled/</link>
		<comments>http://www.petapath.com/blog/2009/08/20/rapidmind-gets-gobbled/#comments</comments>
		<pubDate>Thu, 20 Aug 2009 13:28:00 +0000</pubDate>
		<dc:creator>Dairsie</dc:creator>
				<category><![CDATA[Compilers]]></category>
		<category><![CDATA[gpgpu]]></category>
		<category><![CDATA[Heterogeneous]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[Intel]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Rapidmind]]></category>

		<guid isPermaLink="false">http://www.petapath.com/blog/?p=138</guid>
		<description><![CDATA[And the breaking news is that Rapidmind has been acquired by Intel.]]></description>
			<content:encoded><![CDATA[<p>And the breaking news is that <a title="Rapidmind Intel Announcement" href="http://www.rapidmind.com/company.php" target="_blank">Rapidmind </a>has been acquired by <a title="Intel + Rapidmind" href="http://software.intel.com/en-us/blogs/2009/08/19/rapidmind-intel/" target="_blank">Intel</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.petapath.com/blog/2009/08/20/rapidmind-gets-gobbled/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>JPR Whitepaper on multi-AIB systems</title>
		<link>http://www.petapath.com/blog/2009/08/04/jpr-whitepaper-on-multi-aib-systems/</link>
		<comments>http://www.petapath.com/blog/2009/08/04/jpr-whitepaper-on-multi-aib-systems/#comments</comments>
		<pubDate>Tue, 04 Aug 2009 20:22:40 +0000</pubDate>
		<dc:creator>Dairsie</dc:creator>
				<category><![CDATA[AMD]]></category>
		<category><![CDATA[DirectX11]]></category>
		<category><![CDATA[gpgpu]]></category>
		<category><![CDATA[Heterogeneous]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[Links]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[NVIDIA]]></category>
		<category><![CDATA[OpenCL]]></category>
		<category><![CDATA[Views]]></category>
		<category><![CDATA[Khronos]]></category>

		<guid isPermaLink="false">http://www.petapath.com/blog/?p=110</guid>
		<description><![CDATA[The recent whitepaper from Jon Peddie on Multi GPU issues and opportunities is an interesting read. It&#8217;s a pretty even-handed analysis, as you would expect from someone of Jon&#8217;s experience. However I do wonder about the way in which it is being reported. I&#8217;ve noticed this whitepaper picked up in various places today, with very [...]]]></description>
			<content:encoded><![CDATA[<p>The recent whitepaper from Jon Peddie on <a title="JPR GPU Report" href="http://www.jonpeddie.com/special/WhitePapers/Multi-GPU-issues-and-opportunities.pdf" target="_blank"><em>Multi GPU issues and opportunities</em></a> is an interesting read. It&#8217;s a pretty even-handed analysis, as you would expect from someone of Jon&#8217;s experience. However I do wonder about the way in which it is being reported. I&#8217;ve noticed this whitepaper picked up in various <a title="JP Report on AIB " href="http://en.expreview.com/2009/08/03/nearly-half-of-pcs-to-be-powered-by-multi-gpu-tech-in-2012.html" target="_blank">places</a> today, with very subtly differing <a href="http://www.vizworld.com/2009/08/new-jon-peddie-report-50-penetration-on-gpgpu/" target="_blank">takes</a>, but the headline that most people are pulling out is that JPR has predicted that in the next three years, nearly half of all PCs will have multiple GPU AIBs (Add In Boards). Unfortunately while the re-broadcasting the whitepaper as news, most commentators haven&#8217;t seen fit to suggest why this might be the case. I see it as an interesting exercise in cause and effect!</p>
<p>Jon spends much more time looking at CAGRs, IHV marketing decks and sales projections than I do, but having read the report I do wonder if he&#8217;s missed a trick or two while staring at his tea leaves. Reading the report it&#8217;s clear that JPR&#8217;s remit was to explore multi-GPU from the perspective of scaling the graphics performance by using multi-AIB and multi-GPU systems. Not too surprising given that the report was at least partially sponsored by LucidLogix and to a lesser extent AMD and NVIDIA.</p>
<p>As an aside, LucidLogix are an interesting entrant into the graphics market, as they are producing an IHV agnostic chip that potentially allows for multi-GPU scaling to take a very interesting turn indeed. It&#8217;s not clear yet how their product will be greeted by the market (or indeed the IHVs), but if it works as well as they say it does (and I haven&#8217;t seen it in action yet) it has the potential to break the current state of single vendor (and mostly single device variant) multi-GPU systems (SLI vs Crossfire).It also has the ability to give habitual AIB buyers a far longer working life for their previous purchases, as in theory, a motherboard enhanced with a LucidLogix device means you can run your latest GPU in parallel with your previous primary graphics card.</p>
<p>ATI/AMD promised this a while back but I&#8217;m not sure it was ever delivered in any meaningful way (they were promoting it as a potential route for physics acceleration around the time NVIDIA acquired PhysX) and NVIDIA have shown systems where an NVIDIA IGP and an NVIDIA AIB co-exist and the driver selects the most appropriate device for a given workload (Hybrid SLI). With a LucidLogix Hydra device acting as a bridge, the theory is that the user sees something approaching additive scaling (certainly not a given from existing solutions) from a mix of different GPUs and best of all you won&#8217;t necessarily be tied to a single vendor either. Of course there&#8217;s likely to be a very long list of caveats to achieving this multi-GPU nirvana but that&#8217;s another (very interesting) blog entry and I digress.</p>
<p>My immediate thought while reading the JPR report was that the analysis curiously excluded IGPs (Integrated Graphics Processors) and thus I presume, devices coming from AMD and Intel with graphics integrated into the same package as the CPU (e.g. Fusion from AMD). The current trend to integrate graphics on the CPU package is a cost driven evolution from North Bridge IGPs (memory controllers already having moved to the CPU), with both AMD and Intel wanting to leverage their position as &#8216;platform&#8217; vendors, and to offer price reductions to PC OEMs (but coincidentally increase the proportion of the PC&#8217;s BOM they see).</p>
<p>In terms of the number of people who only ever see an integrated graphics solution in action (be it in the chipset or in the future next to the CPU) we&#8217;re probably approaching 50%, given Intel&#8217;s current dominance in the mobile and mid to low cost desktop markets. If AMD&#8217;s Fusion range succeeds it will mostly likely take market share away from Intel rather than the AIB market. So this suggests one way in which JPR sees the market evolving, but apart from the continuing drive to improve rendering performance he doesn&#8217;t really offer any other market drivers. Should he have?</p>
<p>In recent years what has driven the evolution of the PC and the growth of the GPU vendors, has really been games. THis isn&#8217;t going to stop any time soon (even with consoles accounting for an increasing proportion of that market). At the same time the move to use COTS (Commercial Off The Shelf) systems in HPC (High Performance Compute) has also been a significant evolutionary driver for x86 as a platform.</p>
<p>So where does that leave this new wave of heterogeneous compute that the GPU vendors are so keen to exploite these days? Speaking as a person who already works with heterogeneous systems (i.e. accelerated using GPUs and other specialist co-processors) to solve engineering and scientific problems for clients, the quest for better performance for ISV (and custom) codes will drive adoption of in the personal workstation market, but that still leaves the consumer side of the equation.</p>
<p>This is where being involved in the Khronos Group and watching the evolution of OpenCL from the inside, has shown me just how wide a reach OpenCL (and other related APIs) potentially could have. It won&#8217;t just affect the software we write but it also has the potential to shape the direction that the future of PC architecture will take and ultimately this is bound up with the software that we run on them. This leads me back to my cause and effect observation: Just what would drive 50% of all PC owners by 2012 to have bought at least one additional AIB (or bought a machine that shipped with two AIBs)?</p>
<p>We have already seen that the market wasn&#8217;t quite ready for a single source, vendor specific and cost option as far as physics acceleration was concerned. This is not to say that PhysX has been a failure, but it&#8217;s not currently achieved the market penetration to really drive sales of NVIDIA AIBs in its own right. AMD have lately played smart, and offered a potential counter to the PhysX marketing bullet point that also addresses the issue of vendor specific solutions, by porting the Intel owned Havok physics engine to OpenCL (an interesting move in itself from a marketing perspective).</p>
<p>All of this leads to more questions than answers at the moment. Will the relatively recent existence of an open, cross platform and most importantly cross vendor programming target in OpenCL feed the growth of a non-game based software ecosystem, that is not just able to take advantage of heterogeneous acceleration, but will actually drive it in some quite remarkable directions? Will OpenCL be able to meet the Microsoft juggernaut (in the form of DirectX11 Compute shaders) head on?</p>
<p>I actually think these two approaches to tapping the compute horsepower available from GPUs are actually complementary rather than necessarily in direct opposition. DX11 Compute Shaders (and their evolutionary descendants) will undoubtedly enable performance improvements for games and open up a wealth of new options for game developers, but I think the smart money for ISVs wanting to develop other applications for heterogeneous systems will be with OpenCL. When the OpenCL standard matures a little and the IHVs work out some of the current interoperability issues, we will start to see truly heterogeneous software solutions arrive and this, I think, could be really important. A healthy software ecosystem will sustain, though it may not significantly grow the current AIB market (much as JPR predicts), but a lot still depends on where the gamers end up because at the moment this is what&#8217;s driving the year on year improvement in GPU performance.</p>
<p>As luck would have it Neil Trevett, the Khronos Group President, and incidentally an NVIDIA VP, has an interesting <a title="Tech Report interview with Neil Trevett" href="http://www.techreport.com/articles.x/17321" target="_blank">interview </a>on The TechReport today along these lines.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.petapath.com/blog/2009/08/04/jpr-whitepaper-on-multi-aib-systems/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Siggraph 2009</title>
		<link>http://www.petapath.com/blog/2009/08/03/siggraph-2009/</link>
		<comments>http://www.petapath.com/blog/2009/08/03/siggraph-2009/#comments</comments>
		<pubDate>Mon, 03 Aug 2009 11:14:18 +0000</pubDate>
		<dc:creator>Dairsie</dc:creator>
				<category><![CDATA[gpgpu]]></category>
		<category><![CDATA[Heterogeneous]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://www.petapath.com/blog/?p=107</guid>
		<description><![CDATA[I&#8217;m expecting to see a fairly significant number of announcements relating to heterogeneous computing at Siggraph this year.  While fully accelerated, production quality rendering pipelines running on large scale heterogeneous render farms may not quite be here yet (and there is reason to suspect that will always be a pipedream), there are plenty of places [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m expecting to see a fairly significant number of announcements relating to heterogeneous computing at Siggraph this year.  While fully accelerated, production quality rendering pipelines running on large scale heterogeneous render farms may not quite be here yet (and there is reason to suspect that will always be a pipedream), there are plenty of places in the production process which can benefit from the power of these systems. Look to see a lot of work-flow related innovations, designed to make an individual animator or TD more productive. We&#8217;ve already seen particular emphasis on accelerated previews for complex shading/lighting on GPUs so we&#8217;ll seem more of the same and also accelerated particle systems and physics solvers for complex interactive environments.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.petapath.com/blog/2009/08/03/siggraph-2009/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>PGI Launch Compilers for Heterogeneous Computing</title>
		<link>http://www.petapath.com/blog/2009/07/21/pgi-launch-compilers-for-heterogeneous-computing/</link>
		<comments>http://www.petapath.com/blog/2009/07/21/pgi-launch-compilers-for-heterogeneous-computing/#comments</comments>
		<pubDate>Tue, 21 Jul 2009 11:05:40 +0000</pubDate>
		<dc:creator>Dairsie</dc:creator>
				<category><![CDATA[Compilers]]></category>
		<category><![CDATA[CUDA]]></category>
		<category><![CDATA[Heterogeneous]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[Tools]]></category>
		<category><![CDATA[Views]]></category>
		<category><![CDATA[Developer]]></category>
		<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://www.petapath.com/blog/?p=86</guid>
		<description><![CDATA[PGI have released version 9.0 of their Fortran and C99 compilers which includes the PGI Accelerator™  support (effectively x86+GPU). PGI are making some grand claims for these extensions but without having used the tools for myself I am doubtful that they are quite as all singing and dancing as PGI are making out. The Programming [...]]]></description>
			<content:encoded><![CDATA[<p>PGI have released version 9.0 of their Fortran and C99 compilers which includes the PGI Accelerator™  support (effectively x86+GPU). PGI are making some grand claims for these extensions but without having used the tools for myself I am doubtful that they are quite as all singing and dancing as PGI are making out. The <a title="PGI Accelerator Programming Model" href="http://www.pgroup.com/lit/whitepapers/pgi_accel_prog_model_1.0.pdf" target="_blank">Programming Model</a> whitepaper does address some of the critical aspects of porting software to work effectively on heterogeneous system (which is less about the compute and more about data movement and maximising bandwidths once on the accelerator) but it also serves to mask an important factor that is currently a sticking point for many people exploring the use of heterogeneous systems; that applications written for x86 may not be the best starting point for best performance on accelerators.</p>
<p>Update: Incidentally there is quite a bit of information on the PGI <a title="PGI Accelerator Compilers" href="http://www.pgroup.com/resources/accel.htm" target="_blank">web site</a> and a solid series of articles by <a title="HPCWire 'Compilers &amp; More'" href="http://www.pgroup.com/resources/articles.htm" target="_blank">Michael Wolfe published on HPCWire</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.petapath.com/blog/2009/07/21/pgi-launch-compilers-for-heterogeneous-computing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk
Page Caching using disk (user agent is rejected)

Served from: www.petapath.com @ 2012-02-05 13:28:55 -->
