<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Running Systems&#187; disk access</title>
	<atom:link href="http://run.tournament.org.il/tag/disk-access/feed/" rel="self" type="application/rss+xml" />
	<link>http://run.tournament.org.il</link>
	<description>(and me chasing them)</description>
	<lastBuildDate>Wed, 02 Jun 2010 13:50:14 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>dm-multipath and loss of all paths</title>
		<link>http://run.tournament.org.il/dm-multipath-and-loss-of-all-paths/</link>
		<comments>http://run.tournament.org.il/dm-multipath-and-loss-of-all-paths/#comments</comments>
		<pubDate>Tue, 13 May 2008 03:00:22 +0000</pubDate>
		<dc:creator>ez-aton</dc:creator>
				<category><![CDATA[Disk Storage]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[disk access]]></category>
		<category><![CDATA[disk failures]]></category>
		<category><![CDATA[dm-]]></category>
		<category><![CDATA[filesystem]]></category>
		<category><![CDATA[HP]]></category>
		<category><![CDATA[mpath]]></category>
		<category><![CDATA[multipath]]></category>
		<category><![CDATA[netapp]]></category>
		<category><![CDATA[SCSI]]></category>

		<guid isPermaLink="false">http://www.tournament.org.il/run-new/?p=188</guid>
		<description><![CDATA[dm-multipath is a great tool. Its abilities were proven to me on many occasions, and I&#8217;m sure I&#8217;m not the only one. NetApp, for example, use it. HP use it as well (a slightly modified version, and still), and it works. A problem I have encountered is as follow &#8211; if a single path fails, [...]


Related posts:<ol><li><a href='http://run.tournament.org.il/hp-ux-allowed-shells-and-connecting-fc-multipath-to-netapp/' rel='bookmark' title='Permanent Link: HP-UX &#8211; allowed shells, and connecting FC Multipath to NetApp'>HP-UX &#8211; allowed shells, and connecting FC Multipath to NetApp</a> <small>When adding a certain shell to an HP-UX system, for...</small></li>
</ol>

Related posts brought to you by <a href='http://mitcho.com/code/yarpp/'>Yet Another Related Posts Plugin</a>.]]></description>
			<content:encoded><![CDATA[<p>dm-multipath is a great tool. Its abilities were proven to me on many occasions, and I&#8217;m sure I&#8217;m not the only one. <a href="http://now.netapp.com" target="_blank">NetApp</a>, for example, use it. <a href="http://www.hp.com/go/devicemapper" target="_blank">HP</a> use it as well (a slightly modified version, and still), and it works.</p>
<p>A problem I have encountered is as follow &#8211; if a single path fails, the device-mapper continue to work correctly (as expected) and the remaining path becomes active. However &#8211; if the last link fails, all processes which require disk access become stale. It means that many tests which search for a given process pass correctly even when this process becomes stale through delayed (forever) access to the filesystem. Also &#8211; tests which attempt to write/read a file to/from such a stale filesystem, become stale themselves, which can bring down an entire system (assume we have a cron which creates a file every minute. Every new process becomes stale immediately, so after an hour, we&#8217;ll have 60 more processes, and after a day &#8211; 1440 additional processes &#8211; all stale (D) and waiting for the disk to come back).</p>
<p>Certain detection systems actually fail to auto-detect cases of stale filesystems when using dm-multipath. This is caused by a (default) option called &#8220;1 queue_if_no_path&#8221;. I discovered that when this option is omitted, such as in the  configuration below (only the &#8220;device&#8221; section):</p>
<blockquote><p>device<br />
{<br />
vendor               &#8220;NETAPP&#8221;<br />
product              &#8220;LUN&#8221;<br />
getuid_callout       &#8220;/sbin/scsi_id -g -u -s /block/%n&#8221;<br />
prio_callout         &#8220;/sbin/mpath_prio_ontap /dev/%n&#8221;<br />
#     features             &#8220;1 queue_if_no_path&#8221;<br />
hardware_handler     &#8220;0&#8243;<br />
path_grouping_policy group_by_prio<br />
failback             immediate<br />
rr_weight            uniform<br />
rr_min_io            128<br />
path_checker         readsector0<br />
}</p></blockquote>
<p>multiple disk failures will actually result in the filesystem layout reporting I/O errors (which is good). A disk mounted through these options can be mounted with special parameters, such as (for ext2/3): errors=continue ; errors=read-only or errors=panic &#8211; my favorite, as it ensured data integrity through self-fencing mechanism.</p>


<p>Related posts:<ol><li><a href='http://run.tournament.org.il/hp-ux-allowed-shells-and-connecting-fc-multipath-to-netapp/' rel='bookmark' title='Permanent Link: HP-UX &#8211; allowed shells, and connecting FC Multipath to NetApp'>HP-UX &#8211; allowed shells, and connecting FC Multipath to NetApp</a> <small>When adding a certain shell to an HP-UX system, for...</small></li>
</ol></p>
<p>Related posts brought to you by <a href='http://mitcho.com/code/yarpp/'>Yet Another Related Posts Plugin</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://run.tournament.org.il/dm-multipath-and-loss-of-all-paths/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
