Oracle RAC troubleshooting

Oracle RAC shared disk method of clustering databases increases scalability and lowers costs. However, it can be challenging to find and fix bugs in Oracle 10g. This chapter excerpt from "Oracle Database 10g: Real Application Clusters Handbook" provides diagnostic frameworks and tools needed to tackle even complicated database problems.

Reseller takeaway: Excerpted from the book Oracle Database 10g: Real Applications Clusters Handbook, Chapter 14 provides you with tips and tools to troubleshoot RAC issues for your clients. After you've read this chapter you'll understand how to work with the log directory structure, database performance issues, cluster reconfiguration and much more.

RAC troubleshooting

In this chapter we look into the details of debugging Oracle Real Application Clusters, from simple startup problems to complex system hang or crash problems. As a single piece of software, Oracle RDBMS is the one of the most complex commercial products in the world. But with the help of a solid and extensive diagnostics framework, you can usually diagnose even complex problems simply by viewing and interpreting Oracle's detailed trace files.

Each instance in a cluster has its own alert logs, which will be the first and foremost thing to examine whenever a problem is reported. Alert logs show detailed information about the basic settings of the database, including the non-default parameters used. Alert logs also contain information about startup and shutdown, and details of node(s) joining and leaving with timestamps. The alert log is specific to each instance and the location of the log is specified by the initialization parameter background_dump_dest, which also defines the location of the background process trace files. The trace files of the other background process such as LMON or LMD traces are also written in the location specified by this parameter. If the shared server is configured, trace files of the shared servers are also written in the directory.

Use the following table of contents to navigate to chapter excerpts or click here to view  RAC Troubleshooting in its entirety.



 Oracle Database 10g: Real Application Clusters Handbook

  Home: Oracle RAC troubleshooting -- chapter introduction
   1: Oracle RAC: Log directory structure in cluster ready services
   2:  Oracle RAC: Log directory structure in Oracle RDBMS
   3: Oracle RAC and the Lamport algorithm
  4:  Oracle RAC: ON and OFF
   5:  Oracle RAC: Database performance issues
   6:  Oracle RAC: Debugging node eviction issues
   7:  Oracle RAC: Member voting
   8:  Oracle RAC: Cluster reconfiguration steps
   9:  Oracle RAC: Debugging CRS and GSD using DTRACING

About the book:   
Oracle Database 10g: Real Applications Clusters Handbook Learn to implement Oracle real application clusters from the ground up. Maximize database availability, scalability, and efficiency. Find RAC concepts, administration, tuning, and troubleshooting information. You'll learn how to prepare and create Oracle RAC databases and servers, and automate administrative tasks. You'll also get full coverage of cutting-edge Oracle RAC diagnostic tools, backup and recovery procedures, performance tweaks and custom application design strategies. Buy this book at McGraw-Hill/Osborne
About the author:   
 K Gopalakrishnan is a senior principal consultant with the Advanced Technology Services group at Oracle Corporation, specializing exclusively in performance tuning, high availability, and disaster recovery. He is a recognized expert in Oracle RAC and Database Internals and has used his extensive expertise in solving many vexing performance issues all across the world for telecom giants, banks, financial institutions, and universities.


Dig Deeper on Database software management