Archive for category general
Off to Oracle Openworld
Posted by Jay Caviness in Oracle Openworld, general on October 9th, 2009
I am leaving for Openworld tomorrow and presenting on Monday. I will be at most of the IOUG RAC-SIG events. Feel free to say hello, I look forward to seeing everyone again this year.
11gR2 is Available!
Posted by Jay Caviness in general on September 1st, 2009
Finally! I have been holding my breath on this announcement for almost 18 months since I was invited to be on the RAC/ASM Beta test team for 11gR2. It has been a long road and have had to keep my mouth shut myriad times at Openworld last year and IOUG Collaborate back in May. I have not been this excited about a release since Oracle 10g. This is what a grid is supposed to be. The changes to RAC and ASM alone will take me several posts, but I will get out as much information as I can as quickly as I can.
Let’s just start this way, for RAC, 11gR2 is a quantum shift. 11gR1 is essentially the same clusterware as 10gR2, fixes but no major changes. R2 redefines the game. While you can use the old style RAC clusters, R2 offers the way out of defining nodes by name and number and allows on the fly provisioning across clusters. Other new features include:
1. moving ASM into the cluster (one less Oracle_Home to worrry about)
2. storing OCR and voting disks within ASM
3. the use of a SCAN to remove the need of clients knowing VIPs
4. the use of GNS (Grid Naming Service) to take advantage of Grid Plug’n'Play in expanding a cluster on the fly
5. ASM filesystem (ACFS)
I will delve more into these and other subjects in the near future. I have been working on 11gR2 for quite a while now and while there are lofty expectations, 11gR2 will be there to meet them all.
A word of warning, however, don’t click the video tab on the link above, or you will have to stare at my mug! I couldn’t see a thing without my glasses while being filmed!
TOD – Top 5 reasons the cluster will not start on a node.
Posted by Jay Caviness in general on July 28th, 2009
I have mentioned before the top two, but here are the top 5 most common reasons that the CRS, CSS, EVM, etc, processes will not start on a node:
1. Network issue – NIC card might be unavailable, VIP assigned to a node on a different system, switch may be bad, etc.
2. Storage issue – verify that all the voting and OCR disks/luns are accessible on all nodes and are owned appropriately (OCR – root:dba & 640, Voting Disks – oracle:dba * 640). You can test by dumping with dd and piping to strings – dd if=/dev/raw/raw2 | strings .
3. Too many sockets – look in /tmp/.oracle and /var/tmp/.oracle and remove any and all socket files. These can build up between reboots after many failed attempts to start the cluster.
4. Filesystem full – if the filesystem containing CRS_HOME is full, or becomes full during the start of a cluster, it will just hang for the 600 second timeout and die. If it can’t write a log, it won’t start up.
5. /etc/oracle missing or corrupt. The /etc/oracle directory is created when root.sh is run for CRS. it contains the OCR location and other files needed by the cluster. If CRS cannot find the location of the OCR files, it will look in default locations and then simply hang while searching for the default 600 seconds.
As always, search out the logs in $CRS_HOME/log/
Keep on RAC’n!
New format for Grumpy-DBA.com
Posted by Jay Caviness in general on July 14th, 2009
I am experimenting with a new format on my blog site. I had received some comments that with the three column format, long text boxes were being cut off. I started using this theme with only one side column to allow for a wider post format.
Attributes of a Great DBA #1 – Humility
Posted by Jay Caviness in general on May 22nd, 2009
Humility and a decided lack of ego. It is one thing to be confident with your skills, however, most of the best Oracle people I know are also the most humble. Case in point is Mike Ault. I have known of his work, books and appreciation for all things Oracle for many years. I met him in person about two years ago at RMOUG training days in Denver and again at IOUG Collagorate 09 in Orlando. Instead of regaling me with Oracle knowledge, we talked of diving, kids and tiramisu. Why is this so important? Because while you can learn sitting at the feet of a guru, you can also learn something new from even the lowest, newest and greenest of people if you are open to it. I have also known engineers who believe the world should revolve around them, and you know what? They are usually constantly involved in a perceived crisis that someone, anyone, else caused.
Humility will gain you the world. There are a few Oracle bigwigs out there that set up shop at a conference and look more like PT Barnum than a credible source all in the name of hawking their books, services or advice. (Those that know me know of whom I speak!) Getting a perspective from others with different talents than you and lesser as well as greater talents than you. Listen to your colleagues and don’t rush to judgment on ideas just because they are from a newbie, you never know where that spark will come from that will solve a problem now in the future.
Attributes of a Great DBA #2 – Integrity
Posted by Jay Caviness in general on May 15th, 2009
Integrity – be honest in all you do, it is easier than trying to remember what you lied about! There is not a whole lot is new under the sun, ideas/scripts/processes are products of hashing and rehashing old ideas with new to create something that fits your needs. My favorite ASM scripts are based on Jeff Hunters scripts, he certainly writes better formatted SQL than I do and the scripts I based my ASM scripts on are very useful for me. Often plagiarism goes under the guise of “code reuse”, that is fine, but give credit where credit is due.
I have solved many problems for customers over the years, but try to never leave them without an understanding of what went on, how it was resolved and how it was fixed if at all possible and time permits. This is vital to to your client relationship and your own sense of self worth. There are times where a root cause analysis does not bare anything out, and you have to explain to a client or manager that the cause may be found, but it may be cost and time prohibitive. Be honest in all things you do, covering something up almost always involves digging your own grave. Having been on more than one forensic analysis teams, I have seen first hand what happens when someone either maliciously damages a system or damages it accidentally, and then tries to cover it up. It rarely works and the damage to your reputation can be permanent.
Attributes of a Great DBA #3 – Imagination
Posted by Jay Caviness in general on May 13th, 2009
Imagination – Above intelligence? You bet. The ability to think outside the box is critical, much of that comes from experience. I am not a huge Star Trek fan, but I remember a scene from The Wrath of Khan when the crew was trying to force their enemy’s shields down and Kirk said “You have to know why things work.” That is an excellent point, so many things can affect an Oracle instance, database, cluster, etc. It is often like the large mixing boards that music producers use in a studio. If you move one slide up 12 others may move down. It partially comes back to #4, that if you amass knowledge in many different areas, they will mix in such a way that your imagination can find soluti0ns to which there is no (apparent) logic.
Attributes of a Great DBA – #4 Intelligence
Posted by Jay Caviness in general on May 6th, 2009
Intelligence – This had to be in the list right? Not number one, however. There is a difference in “book learnin’” and intelligence. Intelligence is more the process of solving a problem to the point of resolution. It can be coupled directly with wisdom. A DBA must have the ability to go from point A to B to C to solve a problem. With experience, that process may go from A to D to Z because of an intuition born of experience even if you have not seen a similar problem before.
Where do you get this intelligence? As noted above, time, in the guise of experience, is a large part. Absorbing information my multiple sources is the majority of the rest. Oracle is the type of software that you learn by doing, not reading. Don’t just troll OTN forums and blogs, participate! Get a dialog going, you would be amazed at what you can learn from people in similar and dis-similar circumstances. If you are a RAC person join the RAC SIG (www.oracleracsig.org). Most importantly, however, is learn about more than just your area of expertise, by getting outside your comfort zone. While I am a specialist at RAC, I try to be a generalist in as many IT engineering areas as I can. For example, don’t just subscribe to Oracle Magazine or Select Journal, get Network World or Storage. You may not understand all the topics and concepts, but with time you will absorb them and when the time comes to make an architecture, support, design or down-time decision, you might find that some important data from outside your comfort zone has helped to make a better informed decision.
Monday at IOUG Collaborate 2009
Posted by Jay Caviness in general on May 4th, 2009
I am on site at IOUG Collaborate ‘09 this week in sunny Orlando, Florida. I will be speaking as part of a customer panel on RAC on Virtual Machines at 4:30 Wednesday. Had a great converstaion on Oracle RAC and streams with Arup Nanda and talked with Mike Ault already, and I am looking forward to the conferences and chatting with many of my friends I have made over the years.
Top 5 attributes of a great DBA
Posted by Jay Caviness in general on April 23rd, 2009
In the next few days I am posting my top 5 list of what it takes to be a great DBA. While many may not agree, it is not all about knowledge and insight. Am I a great DBA? Time and my clients will tell, but these attributes will help to ensure a long and enjoyable career in the Oracle world.
We will start with #5 -
5. Sense of humor & grace under pressure – Jean Kerr may have been right when she said, “If you can keep your head about you when all about you are losing theirs, its just possible you haven’t grasped the situation”. Most of us have been in one type of crisis or another over the years involving Oracle, the systems it lies upon and users, managers and clients beating at the door with torches and pitchforks. Many of us have wondered if we would survive the onslaught. When you are at the end of your rope, tie a knot, make a joke and hang on, there is little left to lose. I have been in this type of situations many times. Most of the time I have been called into the middle of a disaster (or what is believed to be one) with the simple command of “fix it, it be broke!”, and have a cube or conference call of people staring over your shoulder waiting on every keystroke. As bad as my typing is, that is never a good thing to watch.
The first truly memorable situation I had like this was back in the 1997-1998 timeframe, the web was just taking off and Amazon.com was the darling of the internet boom. I was working for Oracle Support at the time as a technical specialist which is supposed to mean that I know more than most about how the database kernel worked when I got called in because Amazon was down. The call between myself and their DBA became the two of us and about 30 other people all on the conference call. I was not allowed to hang up, transfer or call the client back. Everyone, including a seniror Oracle VP and two VPs from Amazon were on the phone and expecting me to articulate every move I made while manually rebuilding several datafile headers with a hex editor that had become corrupted due to a bug. Someone noted that Amazon had just made the national news because they were down. It was not pretty. But after being on the phone for 422 minutes (our phones had counters on them) every one signed off and the problem was fixed. The point to this anecdote is that I had to, with politeness and humore, be able to tell everyone on the line two things, one that I was not going to repeat every keystroke I made to the audience and two, to please shut up and let me do my work. That if they had to make business or political decisions do it on another call. I was tired, grouchy and more than once had to hit the mute button, but kept my cool with the customer.
Now, I have been in the situation where I screwed up and all the only non-explicative thing I could say was to quote that great American philosopher Urkel – “Did I do that?”. Oh boy, believe me, if you have been in the business long enough, you will break something, and break it bad. I have overwritten datafiles, dropped the wrong table, killed the wrong node, just to mention a few.
For those that know me, I fully admit, that when the mess is over (and sometimes when I push “mute” on the phone before it is over) I can get grouchy, grumpy and generally be a joy to be around.
However, a sense of humor does not always work, and you have to know when to pick your battles, as it were. There is one hospital client I was working with when I first started working with McKesson. I was on a conference call with them and let out a couple of my humorous observations at which point I would swear I heard crickets in the background. I realized very quickly that they had no sense of humor and dropped it at that point. The best part of it was that I have not been on a conference call with that client since!
No one wants a comedian during a crisis. But establishing a good raport with a client or group that is experiencing a problem or just in general is often easier with a little good humor and a lot of empathy and grace.