{"id":10987,"date":"2018-02-23T08:16:17","date_gmt":"2018-02-23T07:16:17","guid":{"rendered":"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/"},"modified":"2018-02-23T08:16:17","modified_gmt":"2018-02-23T07:16:17","slug":"oda-manually-resetting-a-cpu-alert-using-fault-manager","status":"publish","type":"post","link":"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/","title":{"rendered":"ODA &#8211; manually resetting a CPU alert using Fault Manager"},"content":{"rendered":"<p>While supporting since several years ODAs of different generations and versions, we faced time to time some hardware alerts sent back by the ILOM. However all of them are not related to real hardware issues and are false positive. To get rid of them the solution is to reset them manually.<\/p>\n<p><!--more--><\/p>\n<p>When an hardware error occurs the first reaction is to open a Service Request and to provide an ILOM snapshot to the support. This can easily be done using the Maintenance menu in the ILOM web interface.<\/p>\n<p>Based on support feedback, they may confirm that this alert is simply a false positive. Another solution if support answer is too slow is simply to give a try\u00a0 \ud83d\ude00<br \/>\nHowever this will need a server reboot to ensure the alert really disappeared.<\/p>\n<p>Here an example of a fault alarm about CPU we faced:<\/p>\n<pre class=\"brush: shell; gutter: false; first-line: 1\">Date\/Time                 Subsystems          Component\n------------------------  ------------------  ------------\nTue Feb 13 14:00:26 2018  Power               PS1 (Power Supply 1)\n        A loss of AC input power to a power supply has been detected.\n        (Probability:100, UUID:84846f3c-036d-6941-eaca-de18c4c236bd,\n        Resource:\/SYS\/PS1, Part Number:7333459, Serial\n        Number:465824T+1734D30847, Reference\n        Document:http:\/\/support.oracle.com\/msg\/SPX86A-8003-EL)\nThu Feb 15 14:27:04 2018  System              DBP (Disk Backplane)\n        ILOM has detected that a PCIE link layer is inactive. (Probability:25,\n        UUID:49015767-38b2-6372-9526-c2d2c3885a72, Resource:\/SYS\/DBP, Part\n        Number:7341145, Serial Number:465136N+1739P2009T, Reference\n        Document:http:\/\/support.oracle.com\/msg\/SPX86A-8009-3J)\nThu Feb 15 14:27:04 2018  System              MB (Motherboard)\n        ILOM has detected that a PCIE link layer is inactive. (Probability:25,\n        UUID:49015767-38b2-6372-9526-c2d2c3885a72, Resource:\/SYS\/MB, Part\n        Number:7317636, Serial Number:465136N+1742P500BX, Reference\n        Document:http:\/\/support.oracle.com\/msg\/SPX86A-8009-3J)\nThu Feb 15 14:27:04 2018  Processors          P1 (CPU 1)\n        ILOM has detected that a PCIE link layer is inactive. (Probability:25,\n        UUID:49015767-38b2-6372-9526-c2d2c3885a72, Resource:\/SYS\/MB\/P1, Part\n        Number:SR3AX, Serial Number:54-85FED07F672D3DD3, Reference\n        Document:http:\/\/support.oracle.com\/msg\/SPX86A-8009-3J)<\/pre>\n<p>&nbsp;<\/p>\n<p>We can see that there are indeed 3 alerts for this issue.<\/p>\n<p>In order to reset such an alert, you need first to log in on the server as root and access the IPMI tool<\/p>\n<pre class=\"brush: shell; gutter: false; first-line: 1\">[root@oda-dbi01 ~]# ipmitool -I open sunoem cli\nConnected. Use ^D to exit.\n\nOracle(R) Integrated Lights Out Manager\n\nVersion 4.0.0.28 r121827\n\nCopyright (c) 2017, Oracle and\/or its affiliates. All rights reserved.\n\nWarning: password is set to factory default.\n\nWarning: HTTPS certificate is set to factory default.\n\nHostname: oda-dbi01-ilom<\/pre>\n<p>&nbsp;<\/p>\n<p>Once in IPMI, you can list the Open Problems to get the same output than above using the following command:<\/p>\n<pre class=\"brush: shell; gutter: false; first-line: 1\">-&gt; ls \/System\/Open_Problems<\/pre>\n<p>In the list of the Open Problems we can find the UUID of the concerned component (see 3rd line)<\/p>\n<pre class=\"brush: shell; gutter: false; first-line: 1\">Thu Feb 15 14:27:04 2018  Processors          P1 (CPU 1)\n        ILOM has detected that a PCIE link layer is inactive. (Probability:25,\n        UUID:49015767-38b2-6372-9526-c2d2c3885a72, Resource:\/SYS\/MB\/P1, Part\n        Number:SR3AX, Serial Number:54-85FED07F672D3DD3, Reference\n        Document:http:\/\/support.oracle.com\/msg\/SPX86A-8009-3J)<\/pre>\n<p>&nbsp;<\/p>\n<p>Now it is time to access the fault manager to reset all alerts related to this UUID<\/p>\n<pre class=\"brush: shell; gutter: false; first-line: 1\">-&gt; cd SP\/faultmgmt\/shell\/\n\/SP\/faultmgmt\/shell\n\n-&gt; start\nAre you sure you want to start \/SP\/faultmgmt\/shell (y\/n)? y<\/pre>\n<p>&nbsp;<\/p>\n<p>The reset of the alert is done with the <em>fmadm<\/em> command<\/p>\n<pre class=\"brush: shell; gutter: false; first-line: 1\">faultmgmtsp&gt; fmadm acquit 49015767-38b2-6372-9526-c2d2c3885a72<\/pre>\n<p>At this point the alerts are already removed from the Open problems. However to make sure the issue is really gone, we need to reboot the ODA and check the Open Problems afterwards.<\/p>\n<p>Note that I presented here the way to check Open Problems using the IPMI command line, but the same output is also available in the ILOM web page.<\/p>\n<p>Hope it helps!<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>While supporting since several years ODAs of different generations and versions, we faced time to time some hardware alerts sent back by the ILOM. However all of them are not related to real hardware issues and are false positive. To get rid of them the solution is to reset them manually.<\/p>\n","protected":false},"author":7,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[229,199],"tags":[813,1046,79,96,872],"type_dbi":[],"class_list":["post-10987","post","type-post","status-publish","format-standard","hentry","category-database-administration-monitoring","category-hardware-storage","tag-alerts","tag-engineered-system","tag-oda","tag-oracle","tag-server"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v27.2 (Yoast SEO v27.2) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>ODA - manually resetting a CPU alert using Fault Manager - dbi Blog<\/title>\n<meta name=\"description\" content=\"Sometimes some false positive alert appear on ODA. This article presents how to reset them manually using the fault manager\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"ODA - manually resetting a CPU alert using Fault Manager\" \/>\n<meta property=\"og:description\" content=\"Sometimes some false positive alert appear on ODA. This article presents how to reset them manually using the fault manager\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/\" \/>\n<meta property=\"og:site_name\" content=\"dbi Blog\" \/>\n<meta property=\"article:published_time\" content=\"2018-02-23T07:16:17+00:00\" \/>\n<meta name=\"author\" content=\"David Hueber\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"David Hueber\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/\"},\"author\":{\"name\":\"David Hueber\",\"@id\":\"https:\/\/www.dbi-services.com\/blog\/#\/schema\/person\/8873e20a98a02305870909f4e3d0088f\"},\"headline\":\"ODA &#8211; manually resetting a CPU alert using Fault Manager\",\"datePublished\":\"2018-02-23T07:16:17+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/\"},\"wordCount\":322,\"commentCount\":0,\"keywords\":[\"Alerts\",\"Engineered system\",\"ODA\",\"Oracle\",\"server\"],\"articleSection\":[\"Database Administration &amp; Monitoring\",\"Hardware &amp; Storage\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/\",\"url\":\"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/\",\"name\":\"ODA - manually resetting a CPU alert using Fault Manager - dbi Blog\",\"isPartOf\":{\"@id\":\"https:\/\/www.dbi-services.com\/blog\/#website\"},\"datePublished\":\"2018-02-23T07:16:17+00:00\",\"author\":{\"@id\":\"https:\/\/www.dbi-services.com\/blog\/#\/schema\/person\/8873e20a98a02305870909f4e3d0088f\"},\"description\":\"Sometimes some false positive alert appear on ODA. This article presents how to reset them manually using the fault manager\",\"breadcrumb\":{\"@id\":\"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Accueil\",\"item\":\"https:\/\/www.dbi-services.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"ODA &#8211; manually resetting a CPU alert using Fault Manager\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.dbi-services.com\/blog\/#website\",\"url\":\"https:\/\/www.dbi-services.com\/blog\/\",\"name\":\"dbi Blog\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.dbi-services.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.dbi-services.com\/blog\/#\/schema\/person\/8873e20a98a02305870909f4e3d0088f\",\"name\":\"David Hueber\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/secure.gravatar.com\/avatar\/fc07284dbd5667f0bed32b0d8d64076ab885746973ea1b5c4e69c6fa7074cf59?s=96&d=mm&r=g\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/fc07284dbd5667f0bed32b0d8d64076ab885746973ea1b5c4e69c6fa7074cf59?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/fc07284dbd5667f0bed32b0d8d64076ab885746973ea1b5c4e69c6fa7074cf59?s=96&d=mm&r=g\",\"caption\":\"David Hueber\"},\"description\":\"David Hueber has ten years of experience in infrastructure operation &amp; management, engineering, and optimization. He is specialized in Oracle technologies (engineering, backup and recovery, high availability, etc.), Service Management standards and Oracle infrastructure operation processes (Service Desk, Change Management, Capacity Planning, etc.). David Hueber is ITILv3 Service Operation Lifecycle certified and Linux LPIC-1 certified. He received a university degree in Informatics and Networks at the IUT Mulhouse, France. He also studied Information Systems at the Conservatoire National des Arts et M\u00e9tiers in Mulhouse, France. His branch-related experience covers Financial Services \/ Banking, Chemicals &amp; Pharmaceuticals, Transport &amp; Logistics, Retail, Food, etc.\",\"url\":\"https:\/\/www.dbi-services.com\/blog\/author\/david-hueber\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"ODA - manually resetting a CPU alert using Fault Manager - dbi Blog","description":"Sometimes some false positive alert appear on ODA. This article presents how to reset them manually using the fault manager","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/","og_locale":"en_US","og_type":"article","og_title":"ODA - manually resetting a CPU alert using Fault Manager","og_description":"Sometimes some false positive alert appear on ODA. This article presents how to reset them manually using the fault manager","og_url":"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/","og_site_name":"dbi Blog","article_published_time":"2018-02-23T07:16:17+00:00","author":"David Hueber","twitter_card":"summary_large_image","twitter_misc":{"Written by":"David Hueber","Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/#article","isPartOf":{"@id":"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/"},"author":{"name":"David Hueber","@id":"https:\/\/www.dbi-services.com\/blog\/#\/schema\/person\/8873e20a98a02305870909f4e3d0088f"},"headline":"ODA &#8211; manually resetting a CPU alert using Fault Manager","datePublished":"2018-02-23T07:16:17+00:00","mainEntityOfPage":{"@id":"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/"},"wordCount":322,"commentCount":0,"keywords":["Alerts","Engineered system","ODA","Oracle","server"],"articleSection":["Database Administration &amp; Monitoring","Hardware &amp; Storage"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/","url":"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/","name":"ODA - manually resetting a CPU alert using Fault Manager - dbi Blog","isPartOf":{"@id":"https:\/\/www.dbi-services.com\/blog\/#website"},"datePublished":"2018-02-23T07:16:17+00:00","author":{"@id":"https:\/\/www.dbi-services.com\/blog\/#\/schema\/person\/8873e20a98a02305870909f4e3d0088f"},"description":"Sometimes some false positive alert appear on ODA. This article presents how to reset them manually using the fault manager","breadcrumb":{"@id":"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.dbi-services.com\/blog\/oda-manually-resetting-a-cpu-alert-using-fault-manager\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Accueil","item":"https:\/\/www.dbi-services.com\/blog\/"},{"@type":"ListItem","position":2,"name":"ODA &#8211; manually resetting a CPU alert using Fault Manager"}]},{"@type":"WebSite","@id":"https:\/\/www.dbi-services.com\/blog\/#website","url":"https:\/\/www.dbi-services.com\/blog\/","name":"dbi Blog","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.dbi-services.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.dbi-services.com\/blog\/#\/schema\/person\/8873e20a98a02305870909f4e3d0088f","name":"David Hueber","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/fc07284dbd5667f0bed32b0d8d64076ab885746973ea1b5c4e69c6fa7074cf59?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/fc07284dbd5667f0bed32b0d8d64076ab885746973ea1b5c4e69c6fa7074cf59?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/fc07284dbd5667f0bed32b0d8d64076ab885746973ea1b5c4e69c6fa7074cf59?s=96&d=mm&r=g","caption":"David Hueber"},"description":"David Hueber has ten years of experience in infrastructure operation &amp; management, engineering, and optimization. He is specialized in Oracle technologies (engineering, backup and recovery, high availability, etc.), Service Management standards and Oracle infrastructure operation processes (Service Desk, Change Management, Capacity Planning, etc.). David Hueber is ITILv3 Service Operation Lifecycle certified and Linux LPIC-1 certified. He received a university degree in Informatics and Networks at the IUT Mulhouse, France. He also studied Information Systems at the Conservatoire National des Arts et M\u00e9tiers in Mulhouse, France. His branch-related experience covers Financial Services \/ Banking, Chemicals &amp; Pharmaceuticals, Transport &amp; Logistics, Retail, Food, etc.","url":"https:\/\/www.dbi-services.com\/blog\/author\/david-hueber\/"}]}},"_links":{"self":[{"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/posts\/10987","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/comments?post=10987"}],"version-history":[{"count":0,"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/posts\/10987\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/media?parent=10987"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/categories?post=10987"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/tags?post=10987"},{"taxonomy":"type","embeddable":true,"href":"https:\/\/www.dbi-services.com\/blog\/wp-json\/wp\/v2\/type_dbi?post=10987"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}