<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: What was wrong?</title>
	<atom:link href="http://coskan.wordpress.com/2011/01/05/what-was-wrong/feed/" rel="self" type="application/rss+xml" />
	<link>http://coskan.wordpress.com/2011/01/05/what-was-wrong/</link>
	<description>What I learned about Oracle</description>
	<lastBuildDate>Wed, 22 May 2013 01:32:44 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: coskan</title>
		<link>http://coskan.wordpress.com/2011/01/05/what-was-wrong/#comment-4306</link>
		<dc:creator><![CDATA[coskan]]></dc:creator>
		<pubDate>Fri, 07 Jan 2011 13:59:17 +0000</pubDate>
		<guid isPermaLink="false">http://coskan.wordpress.com/?p=900#comment-4306</guid>
		<description><![CDATA[I have updated the post with my actions]]></description>
		<content:encoded><![CDATA[<p>I have updated the post with my actions</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: coskan</title>
		<link>http://coskan.wordpress.com/2011/01/05/what-was-wrong/#comment-4305</link>
		<dc:creator><![CDATA[coskan]]></dc:creator>
		<pubDate>Fri, 07 Jan 2011 13:56:38 +0000</pubDate>
		<guid isPermaLink="false">http://coskan.wordpress.com/?p=900#comment-4305</guid>
		<description><![CDATA[you saw the merge but not on the right sqlid :) 

96b6hrt8csw55 was doing merge

903wwaubvn68k was doing an update 

apart from that your analysis is very close to mine 

here is what I see when I checked 961 and 1057 
&lt;pre&gt;
SQL&gt; @usid 961

 SID             PROGRAM              SPID      LASTCALL STATUS
 --------------  -------------------- ------------------ --------
  &#039;961,1&#039;        (DBW0)               26636       255137 ACTIVE
                                 


SQL&gt; @usid 1057

 SID             PROGRAM              SPID        LASTCALL STATUS
 --------------  -------------------- --------  ---------- --------
  &#039;1057,1&#039;       (DBW1)               26638         255143 ACTIVE
                
&lt;/pre&gt;

then I decided to do another check

.. rest is in update]]></description>
		<content:encoded><![CDATA[<p>you saw the merge but not on the right sqlid <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  </p>
<p>96b6hrt8csw55 was doing merge</p>
<p>903wwaubvn68k was doing an update </p>
<p>apart from that your analysis is very close to mine </p>
<p>here is what I see when I checked 961 and 1057 </p>
<pre>
SQL&gt; @usid 961

 SID             PROGRAM              SPID      LASTCALL STATUS
 --------------  -------------------- ------------------ --------
  '961,1'        (DBW0)               26636       255137 ACTIVE
                                 


SQL&gt; @usid 1057

 SID             PROGRAM              SPID        LASTCALL STATUS
 --------------  -------------------- --------  ---------- --------
  '1057,1'       (DBW1)               26638         255143 ACTIVE
                
</pre>
<p>then I decided to do another check</p>
<p>.. rest is in update</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Narendra</title>
		<link>http://coskan.wordpress.com/2011/01/05/what-was-wrong/#comment-4304</link>
		<dc:creator><![CDATA[Narendra]]></dc:creator>
		<pubDate>Thu, 06 Jan 2011 07:33:18 +0000</pubDate>
		<guid isPermaLink="false">http://coskan.wordpress.com/?p=900#comment-4304</guid>
		<description><![CDATA[&lt;i&gt;All I did was check with @swact and another check and took an action. I am after my action and the other check as answer&lt;/i&gt;
I am still trying to understand/digest all the explainations so excuse me if I am shooting in the dark but did your &quot;another check&quot; involve checking what the session 961 was doing? It seems session with SID 961 is causing &quot;Free Buffer Waits&quot;.]]></description>
		<content:encoded><![CDATA[<p><i>All I did was check with @swact and another check and took an action. I am after my action and the other check as answer</i><br />
I am still trying to understand/digest all the explainations so excuse me if I am shooting in the dark but did your &#8220;another check&#8221; involve checking what the session 961 was doing? It seems session with SID 961 is causing &#8220;Free Buffer Waits&#8221;.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Taral Desai</title>
		<link>http://coskan.wordpress.com/2011/01/05/what-was-wrong/#comment-4302</link>
		<dc:creator><![CDATA[Taral Desai]]></dc:creator>
		<pubDate>Wed, 05 Jan 2011 23:42:30 +0000</pubDate>
		<guid isPermaLink="false">http://coskan.wordpress.com/?p=900#comment-4302</guid>
		<description><![CDATA[Opps. Click button earlier Also, 49wv7fcafp592 which id doing direct path io need to do object level checkpoint and this would be the object that is holding that rows]]></description>
		<content:encoded><![CDATA[<p>Opps. Click button earlier Also, 49wv7fcafp592 which id doing direct path io need to do object level checkpoint and this would be the object that is holding that rows</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Taral Desai</title>
		<link>http://coskan.wordpress.com/2011/01/05/what-was-wrong/#comment-4301</link>
		<dc:creator><![CDATA[Taral Desai]]></dc:creator>
		<pubDate>Wed, 05 Jan 2011 23:38:28 +0000</pubDate>
		<guid isPermaLink="false">http://coskan.wordpress.com/?p=900#comment-4301</guid>
		<description><![CDATA[&lt;code&gt;
684 WAITING enq: TX - row lock contention                 32955       146961 1415053318    1703962    7965635 0x54580006: TX mode6                       903wwaubvn68k    4       1256 VALID         ##########
    104 WAITING enq: TX - row lock contention                 60635       174744 1415053318    2686999    2493927 0x54580006: TX mode6                       903wwaubvn68k    2       1259 VALID         ##########
   1357 WAITING enq: TX - row lock contention                  1018        15106 1415053318    2949135    1686270 0x54580006: TX mode6                       903wwaubvn68k    5        297 VALID         ##########
   1450 WAITING enq: TX - row lock contention                    34       141454 1415053318    3014687    1597677 0x54580006: TX mode6                       903wwaubvn68k    1       1067 VALID         ##########
   1444 WAITING enq: TX - row lock contention                 12087       135239 1415053318    3473432    1523820 0x54580006: TX mode6                       903wwaubvn68k    4        779 VALID         ##########
   1454 WAITING enq: TX - row lock contention                 34927          799 1415053318    1441817   14075933 0x54580006: TX mode6    

&lt;code&gt;

Seconds in wait to high and all this are on same transaction slot with mode 6. Someone might need to rollback or commit.

May be i am wrong.]]></description>
		<content:encoded><![CDATA[<p><code><br />
684 WAITING enq: TX - row lock contention                 32955       146961 1415053318    1703962    7965635 0x54580006: TX mode6                       903wwaubvn68k    4       1256 VALID         ##########<br />
    104 WAITING enq: TX - row lock contention                 60635       174744 1415053318    2686999    2493927 0x54580006: TX mode6                       903wwaubvn68k    2       1259 VALID         ##########<br />
   1357 WAITING enq: TX - row lock contention                  1018        15106 1415053318    2949135    1686270 0x54580006: TX mode6                       903wwaubvn68k    5        297 VALID         ##########<br />
   1450 WAITING enq: TX - row lock contention                    34       141454 1415053318    3014687    1597677 0x54580006: TX mode6                       903wwaubvn68k    1       1067 VALID         ##########<br />
   1444 WAITING enq: TX - row lock contention                 12087       135239 1415053318    3473432    1523820 0x54580006: TX mode6                       903wwaubvn68k    4        779 VALID         ##########<br />
   1454 WAITING enq: TX - row lock contention                 34927          799 1415053318    1441817   14075933 0x54580006: TX mode6    </p>
<p></code><code></p>
<p>Seconds in wait to high and all this are on same transaction slot with mode 6. Someone might need to rollback or commit.</p>
<p>May be i am wrong.</code></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Radoslav Golian</title>
		<link>http://coskan.wordpress.com/2011/01/05/what-was-wrong/#comment-4300</link>
		<dc:creator><![CDATA[Radoslav Golian]]></dc:creator>
		<pubDate>Wed, 05 Jan 2011 22:30:20 +0000</pubDate>
		<guid isPermaLink="false">http://coskan.wordpress.com/?p=900#comment-4300</guid>
		<description><![CDATA[Hm, I&#039;m not very experienced troubleshooter, but I would probably check statements with that fullscans where buffer cache is used (db file scattered read), especially: 96b6hrt8csw55 (fullscan on partitioned table)
and 903wwaubvn68k 
903wwaubvn68k is very interesting it contains fullscan (scattered reads), requests an exclusive locks (so it&#039;s modifying data) and it is also doing index reads (sequential reads)
Is it an MERGE statement? Or some update statement with IN/EXISTS?

and I would probably check what sessions 961 and 1057 are doing. 

free buffer waits occurs when session cannot find free buffers to read data or *to build a consistent image of data block* - I think it&#039;s only a symptom]]></description>
		<content:encoded><![CDATA[<p>Hm, I&#8217;m not very experienced troubleshooter, but I would probably check statements with that fullscans where buffer cache is used (db file scattered read), especially: 96b6hrt8csw55 (fullscan on partitioned table)<br />
and 903wwaubvn68k<br />
903wwaubvn68k is very interesting it contains fullscan (scattered reads), requests an exclusive locks (so it&#8217;s modifying data) and it is also doing index reads (sequential reads)<br />
Is it an MERGE statement? Or some update statement with IN/EXISTS?</p>
<p>and I would probably check what sessions 961 and 1057 are doing. </p>
<p>free buffer waits occurs when session cannot find free buffers to read data or *to build a consistent image of data block* &#8211; I think it&#8217;s only a symptom</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: coskan</title>
		<link>http://coskan.wordpress.com/2011/01/05/what-was-wrong/#comment-4299</link>
		<dc:creator><![CDATA[coskan]]></dc:creator>
		<pubDate>Wed, 05 Jan 2011 18:43:46 +0000</pubDate>
		<guid isPermaLink="false">http://coskan.wordpress.com/?p=900#comment-4299</guid>
		<description><![CDATA[I can say you add different dimension to my own analysis which is good that I learn more things from my own post:) 

Script is slightly modified version of SW.sql (modification is for buffer busy reads nothing to do with this post) called for active foreground sessions with a newer name @swact 

This server was completely &quot;new&quot; to me apart from the fact that I only knew  the massive core table with 500GB size was partitioned with dbms_redefinition and &quot;partitioning&quot; operation was finished reported as success but as I said the main index was missing that was causing other sessions suffer. 

All I did was check with @swact and another check and took an action. I am after my action and the other check as answer. 

Thanks for stopping by by the way]]></description>
		<content:encoded><![CDATA[<p>I can say you add different dimension to my own analysis which is good that I learn more things from my own post:) </p>
<p>Script is slightly modified version of SW.sql (modification is for buffer busy reads nothing to do with this post) called for active foreground sessions with a newer name @swact </p>
<p>This server was completely &#8220;new&#8221; to me apart from the fact that I only knew  the massive core table with 500GB size was partitioned with dbms_redefinition and &#8220;partitioning&#8221; operation was finished reported as success but as I said the main index was missing that was causing other sessions suffer. </p>
<p>All I did was check with @swact and another check and took an action. I am after my action and the other check as answer. </p>
<p>Thanks for stopping by by the way</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Charles Hooper</title>
		<link>http://coskan.wordpress.com/2011/01/05/what-was-wrong/#comment-4298</link>
		<dc:creator><![CDATA[Charles Hooper]]></dc:creator>
		<pubDate>Wed, 05 Jan 2011 17:27:56 +0000</pubDate>
		<guid isPermaLink="false">http://coskan.wordpress.com/?p=900#comment-4298</guid>
		<description><![CDATA[There really is not enough information to make an accurate guess of what is happening in this case.  Is swact.sql Tanel&#039;s session watcher script?

If I had to make a guess, I would look at the SEC_IN_STATE column and see that there are several really big numbers there, the largest at 174,744.  The script seems to be showing that only one session is on the CPU (probably the session executing swact.sql), while all other sessions are listed as waiting.

[sourcecode]
SQL&gt; SELECT 174744/60/60/24 FROM DUAL;
 
174744/60/60/24
---------------
         2.0225
 
SQL&gt; SELECT TO_CHAR(TO_DATE(&#039;27-DEC-2010 11:30&#039;,&#039;DD-MON-YYYY HH24:MI&#039;)-2.0225,&#039;MM/DD HH24:MI&#039;) FROM DUAL;
 
TO_CHAR(TO_
-----------
12/25 10:57
[/sourcecode]

So, that puts the first row lock contention wait at roughly 11 AM on December 25.  Something, probably a scheduled job, likely started at 11 AM on Christmas day and attempted to insert a row using the SQL statement with SQL_ID 903wwaubvn68k, causing a possible primary key violation if session 1259 committed.  Three other sessions also started being blocked when attempting to execute the same SQL statement (all had different CHILD_NUMBER values, one additional session was attempting to execute SQL ID drjnqxby1927s).  This might suggest that the primary key is not generated by a sequence, but instead by something like:
[sourcecode]
SELECT MAX(ID)+1 NEW_ID FROM T1;
[/sourcecode]
The likely cause of that wait is probably an end-user failing to click OK to indicate that a set of documents printed (or something similar) so that a COMMIT could be performed, before leaving for the Christmas holiday.

The ‘enq: KO - fast object checkpoint’ waits typically indicate that a direct path read is about to happen, possibly due to a parallel query execution or a serial direct read of a table.  The first wait of that type shown started at about 9:12 AM on December 27, and the last one at 11:20 AM.  At roughly 11:20 AM you yelled at the SAN causing it to panic (blogs.sun.com/brendan/entry/unusual_disk_latency) which then triggered additional waits for blocks to be read from the SAN.  The decrease in the ‘free buffer waits’ is simply an indication of less pressure on the buffer cache because all of the other sessions are still waiting for their previous IO requests to complete.

How close is my guess?]]></description>
		<content:encoded><![CDATA[<p>There really is not enough information to make an accurate guess of what is happening in this case.  Is swact.sql Tanel&#8217;s session watcher script?</p>
<p>If I had to make a guess, I would look at the SEC_IN_STATE column and see that there are several really big numbers there, the largest at 174,744.  The script seems to be showing that only one session is on the CPU (probably the session executing swact.sql), while all other sessions are listed as waiting.</p>
<pre class="brush: plain; title: ; notranslate">
SQL&gt; SELECT 174744/60/60/24 FROM DUAL;
 
174744/60/60/24
---------------
         2.0225
 
SQL&gt; SELECT TO_CHAR(TO_DATE('27-DEC-2010 11:30','DD-MON-YYYY HH24:MI')-2.0225,'MM/DD HH24:MI') FROM DUAL;
 
TO_CHAR(TO_
-----------
12/25 10:57
</pre>
<p>So, that puts the first row lock contention wait at roughly 11 AM on December 25.  Something, probably a scheduled job, likely started at 11 AM on Christmas day and attempted to insert a row using the SQL statement with SQL_ID 903wwaubvn68k, causing a possible primary key violation if session 1259 committed.  Three other sessions also started being blocked when attempting to execute the same SQL statement (all had different CHILD_NUMBER values, one additional session was attempting to execute SQL ID drjnqxby1927s).  This might suggest that the primary key is not generated by a sequence, but instead by something like:</p>
<pre class="brush: plain; title: ; notranslate">
SELECT MAX(ID)+1 NEW_ID FROM T1;
</pre>
<p>The likely cause of that wait is probably an end-user failing to click OK to indicate that a set of documents printed (or something similar) so that a COMMIT could be performed, before leaving for the Christmas holiday.</p>
<p>The ‘enq: KO &#8211; fast object checkpoint’ waits typically indicate that a direct path read is about to happen, possibly due to a parallel query execution or a serial direct read of a table.  The first wait of that type shown started at about 9:12 AM on December 27, and the last one at 11:20 AM.  At roughly 11:20 AM you yelled at the SAN causing it to panic (blogs.sun.com/brendan/entry/unusual_disk_latency) which then triggered additional waits for blocks to be read from the SAN.  The decrease in the ‘free buffer waits’ is simply an indication of less pressure on the buffer cache because all of the other sessions are still waiting for their previous IO requests to complete.</p>
<p>How close is my guess?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Anand</title>
		<link>http://coskan.wordpress.com/2011/01/05/what-was-wrong/#comment-4297</link>
		<dc:creator><![CDATA[Anand]]></dc:creator>
		<pubDate>Wed, 05 Jan 2011 15:04:54 +0000</pubDate>
		<guid isPermaLink="false">http://coskan.wordpress.com/?p=900#comment-4297</guid>
		<description><![CDATA[Hi Coskan,

The first thing i would check is  &quot;903wwaubvn68k&quot; sql_id.This particular sql_id is showing &quot;db file scattered read&quot;,&quot;enq: TX - row lock contention&quot; and &quot;free buffer waits&quot;.Also, would have checked session id 961.

&quot;What might be wrong for free buffer event&quot;-- free buffer events can be caused due to inefficient sqls.

Session 1442 shows &quot;write complete waits&quot; event which could be cause of I/O staturation.So, I/O is also to be considered for check.


Regards,
Anand]]></description>
		<content:encoded><![CDATA[<p>Hi Coskan,</p>
<p>The first thing i would check is  &#8220;903wwaubvn68k&#8221; sql_id.This particular sql_id is showing &#8220;db file scattered read&#8221;,&#8221;enq: TX &#8211; row lock contention&#8221; and &#8220;free buffer waits&#8221;.Also, would have checked session id 961.</p>
<p>&#8220;What might be wrong for free buffer event&#8221;&#8211; free buffer events can be caused due to inefficient sqls.</p>
<p>Session 1442 shows &#8220;write complete waits&#8221; event which could be cause of I/O staturation.So, I/O is also to be considered for check.</p>
<p>Regards,<br />
Anand</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ogan Ozdogan</title>
		<link>http://coskan.wordpress.com/2011/01/05/what-was-wrong/#comment-4296</link>
		<dc:creator><![CDATA[Ogan Ozdogan]]></dc:creator>
		<pubDate>Wed, 05 Jan 2011 13:18:30 +0000</pubDate>
		<guid isPermaLink="false">http://coskan.wordpress.com/?p=900#comment-4296</guid>
		<description><![CDATA[Hi Coskan,

First, I would have checked the contentions and wait events so we have a contention and a wait event in this very example. Row lock contention could happened because of the application itself. Free buffer waits are there because DBWn can not cope with the dirty buffers fast enough. Jonathan Lewis posted a nice article here --&gt; http://jonathanlewis.wordpress.com/2006/11/21/free-buffer-waits/
So either the objects were pinned in the buffer or DBWn was not fast enough. However, i also see in the chart that we had a vicious log file switch (checkpoint incomplete) wait event for a while near Dec 28, 09:00 AM. I believe there is something wrong with the log_buffer and the online redolog sizes. If they can be tuned than the problem may dissapper before digging into a real analysis.

Ogan]]></description>
		<content:encoded><![CDATA[<p>Hi Coskan,</p>
<p>First, I would have checked the contentions and wait events so we have a contention and a wait event in this very example. Row lock contention could happened because of the application itself. Free buffer waits are there because DBWn can not cope with the dirty buffers fast enough. Jonathan Lewis posted a nice article here &#8211;&gt; <a href="http://jonathanlewis.wordpress.com/2006/11/21/free-buffer-waits/" rel="nofollow">http://jonathanlewis.wordpress.com/2006/11/21/free-buffer-waits/</a><br />
So either the objects were pinned in the buffer or DBWn was not fast enough. However, i also see in the chart that we had a vicious log file switch (checkpoint incomplete) wait event for a while near Dec 28, 09:00 AM. I believe there is something wrong with the log_buffer and the online redolog sizes. If they can be tuned than the problem may dissapper before digging into a real analysis.</p>
<p>Ogan</p>
]]></content:encoded>
	</item>
</channel>
</rss>
