0% found this document useful (0 votes)

565 views14 pages

Checkpoint Tuning and Troubleshooting Guide

This document provides guidance on checkpoint tuning and troubleshooting. It discusses what checkpoints are, how they impact performance, and parameters like FAST_START_MTTR_TARGET, LOG_CHECKPOINT_INTERVAL, LOG_CHECKPOINT_TIMEOUT, and LOG_CHECKPOINTS_TO_ALERT that can be tuned. It also explains how to interpret and address checkpoint errors reported in the alert log, focusing on optimizing checkpoints for performance while still enabling fast recovery.

Uploaded by

gkiran_ch

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

565 views14 pages

Checkpoint Tuning and Troubleshooting Guide

Uploaded by

gkiran_ch

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 14

Checkpoint Tuning and Troubleshooting Guide [ID 147468.

--------------------------------------------------------------------------------

Modified 28-OCT-2010 Type BULLETIN Status PUBLISHED

Purpose:

This bulletin provides the Database Administrator a better understanding of

incremental checkpoint and a description of four initialization parameters used for checkpoint tuning:

- FAST_START_MTTR_TARGET

- LOG_CHECKPOINT_INTERVAL

- LOG_CHECKPOINT_TIMEOUT

- LOG_CHECKPOINTS_TO_ALERT

It also explains how to interpret and handle checkpoint errors: 'Checkpoint not Complete' and 'Cannot
Allocate New Log' reported in the ALERT<sid>.LOG file.

Contents:

1. What is a Checkpoint?
2. Checkpoints and Performance

3. Parameters related to incremental checkpointing

4. Redo logs and Checkpoint

5. Understanding Checkpoint Error messages ("Cannot allocate new log" and "Checkpoint not
complete")

6. Oracle Release Information

7. Using Statspack to determine Checkpointing Problems

CHECKPOINT TUNING AND ERROR HANDLING

1. What is a Checkpoint?

A Checkpoint is a database event which synchronizes the modified data blocks in memory with the
datafiles on disk. It offers Oracle the means for ensuring the consistency of data modified by
transactions. The mechanism of writing modified blocks on disk in Oracle is not synchronized with the
commit of the corresponding transactions.

A checkpoint has two purposes: (1) to establish data consistency, and (2) enable faster database
recovery. How is recovery faster? Because all database changes up to the checkpoint have been
recorded in the datafiles, making it unnecessary to apply redo log entries prior to the checkpoint. The
checkpoint must ensure that all the modified buffers in the cache are really written to the corresponding
datafiles to avoid the loss of data

which may occur with a crash (instance or disk failure).

Oracle writes the dirty buffers to disk only on certain conditions:

- A shadow process must scan more than one-quarter of the db_block_buffer

parameter.

- Every three seconds.

- When a checkpoint is produced.

A checkpoint is realized on five types of events:

- At each switch of the redo log files.

- When the delay for LOG_CHECKPOINT_TIMEOUT is reached.

- When the size in bytes corresponding to :

(LOG_CHECKPOINT_INTERVAL* size of IO OS blocks)

is written on the current redo log file.

- Directly by the ALTER SYSTEM SWITCH LOGFILE command.

- Directly with the ALTER SYSTEM CHECKPOINT command.

During a checkpoint the following occurs:

- The database writer (DBWR) writes all modified database

blocks in the buffer cache back to datafiles,

- Checkpoint process (ckpt) updates the headers of all

the datafiles to indicate when the last checkpoint

occurred (SCN)
2. Checkpoints and Performance

Checkpoints present a tuning dilemma for the Database Administrator. Frequent

checkpoints will enable faster recovery, but can cause performance

degradation. How then should the DBA address this?

Depending on the number of datafiles in a database, a checkpoint can be a

highly resource intensive operation, since all datafile headers are frozen

during the checkpoint. There is a performance trade-off regarding frequency

of checkpoints. More frequent checkpoints enable faster database recovery

after a crash. This is why some customer sites which have a very low

tolerance for unscheduled system downtime will often choose this option.

However, the performance degradation of frequent checkpoints may not justify

this philosophy in many cases. Let's assume the database is up and running 95%

of the time, and unavailable 5% of the time from infrequent instance crashes

or hardware failures requiring database recovery. For most customer sites, it

makes more sense to tune for the 95% case rather than the rare 5% downtime.

This bulletin assumes that performance is your number one priority and so

recommendations are made accordingly. Therefore, your goal is to minimize the frequency

of checkpoints through tuning.

Tuning checkpoints involves four key initialization parameters

- FAST_START_MTTR_TARGET

- LOG_CHECKPOINT_INTERVAL
- LOG_CHECKPOINT_TIMEOUT

- LOG_CHECKPOINTS_TO_ALERT

These parameters are discussed in detail below.

Recommendations are also given for handling "checkpoint not complete" messages

found in the alert log, which indicate a need to tune redo logs and

checkpoints.

3. Parameters related to incremental checkpointing

Note: Log file switches will always override checkpoints caused by following paarameters.

FAST_START_MTTR_TARGET

Since Oracle 9i FAST_START_MTTR_TARGET parameter is the preferred method

of tuning incremental checkpoint target. FAST_START_MTTR_TARGET enables you

to specify the number of seconds the database takes to perform crash recovery

of a single instance. Based on internal statistics, incremental checkpoint

automatically adjusts the checkpoint target to meet the requirement of

FAST_START_MTTR_TARGET.

V$INSTANCE_RECOVERY.ESTIMATED_MTTR shows the current estimated mean time to

recover (MTTR) in seconds. This value is shown even if FAST_START_MTTR_TARGET

is not specified.

V$INSTANCE_RECOVERY.TARGET_MTTR shows the effective MTTR target in seconds

enforced by the system.

V$MTTR_TARGET_ADVICE shows the number of I/Os resulted by the current workload

under the current MTTR setting and the estimated number of I/Os that would be

resulted by the current workload under other MTTR settings. This view helps

the user to assess the trade-off between runtime performance and setting

FAST_START_MTTR_TARGET to achieve better recovery time.

LOG_CHECKPOINT_INTERVAL

LOG_CHECKPOINT_INTERVAL parameter specifies the maximum number of redo blocks

the incremental checkpoint target should lag the current log tail.

If FAST_START_MTTR_TARGET is specified, LOG_CHECKPOINT_INTERVAL should not

be set or set to 0.

On most Unix systems the operating system block size is 512 bytes.

This means that setting LOG_CHECKPOINT_INTERVAL to a value of 10,000 would

mean the incremental checkpoint target should not lag the current log tail

by more than 5,120,000 (5M) bytes. . If the size of your redo log is 20M, you are taking 4

checkpoints for each log.

LOG_CHECKPOINT_INTERVAL influences when a checkpoint occurs, which means

careful attention should be given to the setting of this parameter, keeping it

updated as the size of the redo log files is changed. The checkpoint

frequency is one of the factors which impacts the time required for the

database to recover from an unexpected failure. Longer intervals between

checkpoints mean that if the system crashes, more time will be needed for the

database to recover. Shorter checkpoint intervals mean that the database will

recover more quickly, at the expense of increased resource utilization during

the checkpoint operation.

This parameter also impacts the time required to complete a database recovery

operation during the roll forward phase of recovery. The actual recovery time

is dependent upon this time, and other factors, such as the type of failure

(instance or system crash, media failure, etc.), and the number of archived

redo logs which need to be applied.

LOG_CHECKPOINT_TIMEOUT

The LOG_CHECKPOINT_TIMEOUT parameter specifies the maximum number of seconds

the incremental checkpoint target should lag the current log tail.

In another word, it specifies how long a dirty buffer in buffer cache can

remain dirty.

Checkpoint frequency impacts the time required for the

database to recover from an unexpected failure. Longer intervals between

checkpoints mean that more time will be required during database recovery.

Oracle recommends using LOG_CHECKPOINT_INTERVAL to control the checkpoint

interval rather than LOG_CHECKPOINT_TIMEOUT, which will initiate a checkpoint

every "n" seconds, regardless of the transaction frequency. This can cause

unnecessary checkpoints in cases where transaction volumes vary. Unnecessary

checkpoints must be avoided whenever possible for optimal performance.

It is a misconception that setting LOG_CHECKPOINT_TIMEOUT to a given

value will initiate a log switch at that interval, enabling a recovery

window used for a stand-by database configuration. Log switches cause a checkpoint,but a checkpoint
does not cause a log switch. The only way to cause a log switch is manually with

ALTER SYSTEM SWITCH LOGFILE or resizing the redo logs to cause

more frequent switches. This is controlled by operating system

blocks, not a timed interval.

Sizing of the online redo logs is critical for performance and recovery.

See additional sections below on redo logs and checkpoints.

LOG_CHECKPOINTS_TO_ALERT

LOG_CHECKPOINTS_TO_ALERT lets you log your checkpoints to the alert file.

Doing so is useful for determining whether checkpoints are occurring at

the desired frequency.

Prior to Oracle9i this parameter was STATIC.

Oracle generally advises this be set to TRUE as the overhead is

negligible but the information in the alert log may be useful.

See Note:76713.1 to have more detail on How those instance parameters can influence the checkpoint.
4. Redo logs and Checkpoint

A checkpoint occurs at every log switch. If a previous checkpoint is already

in progress, the checkpoint forced by the log switch will override the current

checkpoint.

This necessitates well-sized redo logs to avoid unnecessary checkpoints as a

result of frequent log switches.

The lag between the incremental checkpoint target and the log tail is

also limited by 90% of the smallest online log file size. This makes sure

that in most cases log switch would not need to wait for checkpoint.

Because of this, log file sizes should be configured large enough.

A good rule of thumb is to switch logs at most every twenty minutes.

Having your log files too small can increase checkpoint activity and reduce performance.

Oracle recommends the user to set all online log files to be the same size,

and have at least two log groups per thread. The alert log is a valuabletool for

monitoring the rate that log switches occur, and subsequently, checkpoints

occur.

The following is an example of quick log switches

from the alert log:

Fri May 16 17:15:43 1997

Thread 1 advanced to log sequence 1272

Current log# 3 seq# 1272 mem# 0: /prod1/oradata/logs/redologs03.log

Thread 1 advanced to log sequence 1273

Current log# 1 seq# 1273 mem# 0: /prod1/oradata/logs/redologs01.log

Fri May 16 17:17:25 1997

Thread 1 advanced to log sequence 1274

Current log# 2 seq# 1274 mem# 0: /prod1/oradata/logs/redologs02.log

Thread 1 advanced to log sequence 1275

Current log# 3 seq# 1275 mem# 0: /prod1/oradata/logs/redologs03.log

Fri May 16 17:20:51 1997

Thread 1 advanced to log sequence 1276

Current log# 1 seq# 1276 mem# 0: /prod1/oradata/logs/redologs01.log

If redo logs switch every 3 minutes, you will see performance degradation.

This indicates the redo logs are not sized large enough to efficiently handle

the transaction load.

size of the redolog files.

5. Understanding Checkpoint Error messages (“Cannot allocate new log” and “Checkpoint not
complete”)

Sometimes, you can see in your alert.log file, the following corresponding

messages:

Thread 1 advanced to log sequence 248

Current log# 2 seq# 248 mem# 0: /prod1/oradata/logs/redologs02.log

Thread 1 cannot allocate new log, sequence 249

Checkpoint not complete

This message indicates that Oracle wants to reuse a redo log file, but

the current checkpoint position is still in that log. In this case, Oracle must

wait until the checkpoint position passes that log. Because the

incremental checkpoint target never lags the current log tail by more than 90%

of the smallest log file size, this situation may be encountered if DBWR writes

too slowly, or if a log switch happens before the log is completely full,

or if log file sizes are too small.

When the database waits on checkpoints,redo generation is stopped until the

log switch is done.

6. Oracle Release Information

In Oracle8i initialization parameter FAST_START_IO_TARGET causes incremental

checkpoint to automatically adjusting its target so that the number of data

blocks needed by recovery would be no more than FAST_START_IO_TARGET.

This parameter has been deprecated since Oracle 9i in favor of parameter FAST_START_MTTR_TARGET.
7. Using Statspack to determine Checkpointing problems

Statspack snapshots can be taken every 15 minutes or so, these reports gather useful

information about number of checkpoints started and checkpoints completed and number

of database buffers written during checkpointing for that window of time . It also contains

statistics about redo activity. Gathering and comparing these snapshot reports gives you

a complete idea about checkpointing performance at different periods of time.

Another important thing to watch in statspack report is the following wait events,

they could be a good indication about problems with the redo log throughput and checkpointing:

log file switch (checkpoint incomplete)

log file switch (archiving needed)

log file switch/archive

log file switch (clearing log file)

log file switch completion

log switch/archive

log file sync

In the case when one or more of the above wait events is repeated frequently

with considerable values then you need to take an action like adding More
online redo log files or increasing their sizes and/or modifying checkpointing parameters.

--------------------------------------------------------------------------------

Products

--------------------------------------------------------------------------------

Oracle Database Products > Oracle Database > Oracle Database > Oracle Server - Enterprise Edition

Keywords

--------------------------------------------------------------------------------

CHECKPOINT

Errors

--------------------------------------------------------------------------------

ERROR HANDLING

CIC final
No ratings yet
CIC final
37 pages
IOT for Presentation
No ratings yet
IOT for Presentation
13 pages
NetBackup10 AdminGuide Hyper-V
No ratings yet
NetBackup10 AdminGuide Hyper-V
208 pages
Zero Trust Network Access (ZTNA) Solution with code
No ratings yet
Zero Trust Network Access (ZTNA) Solution with code
32 pages
MultiPath On Linux
No ratings yet
MultiPath On Linux
8 pages
CommonCore Gateway
No ratings yet
CommonCore Gateway
26 pages
2023_07_29 VDI Tech Tips
No ratings yet
2023_07_29 VDI Tech Tips
18 pages
Software
No ratings yet
Software
24 pages
[IEEE 2017 19th International Conference on Advanced Communication Technology (ICACT) - Pyeongchang, Kwangwoon Do, South Korea... (Kao, Da-Yu) (Z-Library)
No ratings yet
[IEEE 2017 19th International Conference on Advanced Communication Technology (ICACT) - Pyeongchang, Kwangwoon Do, South Korea... (Kao, Da-Yu) (Z-Library)
6 pages
Globalteckz Client List Final
No ratings yet
Globalteckz Client List Final
24 pages
Power Scripts Code Depot
100% (1)
Power Scripts Code Depot
160 pages
6G and Next-Generation Internet - Under Blockchain Web3 Economy by Abdeljalil Beniiche
No ratings yet
6G and Next-Generation Internet - Under Blockchain Web3 Economy by Abdeljalil Beniiche
133 pages
Jni Qsee
No ratings yet
Jni Qsee
4 pages
Hoffer Mdm12e PP Ch03
No ratings yet
Hoffer Mdm12e PP Ch03
34 pages
SolarWinds LEM Port and Firewall Requirements
No ratings yet
SolarWinds LEM Port and Firewall Requirements
4 pages
Chapter 12
No ratings yet
Chapter 12
34 pages
Leading Geeks GoToAssist Influencing Business People White Paper
No ratings yet
Leading Geeks GoToAssist Influencing Business People White Paper
30 pages
Snap Mirror Entire
No ratings yet
Snap Mirror Entire
16 pages
Ayush Ism
No ratings yet
Ayush Ism
28 pages
Duraimurugan A: Objective
No ratings yet
Duraimurugan A: Objective
2 pages
Data Protection V Data Security V Data Privacy 1629981100
No ratings yet
Data Protection V Data Security V Data Privacy 1629981100
4 pages
Incident Response and Cyber Forensics
No ratings yet
Incident Response and Cyber Forensics
36 pages
Chapter 5 Exam - Ccnp-Tshoot Sp2016
100% (11)
Chapter 5 Exam - Ccnp-Tshoot Sp2016
9 pages
Firewall Analyzer UserGuide
No ratings yet
Firewall Analyzer UserGuide
233 pages
Practice: Deploy An OKE Cluster Using Cloud Shell
No ratings yet
Practice: Deploy An OKE Cluster Using Cloud Shell
11 pages
Advantages and Disadvantages
No ratings yet
Advantages and Disadvantages
2 pages
Facilities Planning and Management Moving Guide: Advance Planning Move Team
100% (1)
Facilities Planning and Management Moving Guide: Advance Planning Move Team
9 pages
Setting Up The Oracle 19c RAC Database From The OVA File
No ratings yet
Setting Up The Oracle 19c RAC Database From The OVA File
5 pages
Snowflake Mastering
No ratings yet
Snowflake Mastering
6 pages
McAfee - Agent - Installation Guide
No ratings yet
McAfee - Agent - Installation Guide
5 pages
Configure The iSCSI in Linux
No ratings yet
Configure The iSCSI in Linux
1 page
Digital Diary Project 2
100% (1)
Digital Diary Project 2
17 pages
Commvault - Storage Policy Creation
No ratings yet
Commvault - Storage Policy Creation
14 pages
TE and TE Agent Installation Guide
No ratings yet
TE and TE Agent Installation Guide
15 pages
Harmony Endpoint WebManagement AdminGuide
No ratings yet
Harmony Endpoint WebManagement AdminGuide
320 pages
50 Splunk Interview Questions and Answers
No ratings yet
50 Splunk Interview Questions and Answers
10 pages
Cloud Computing Security Breaches
No ratings yet
Cloud Computing Security Breaches
54 pages
Splunk 6.0.2 Installation
No ratings yet
Splunk 6.0.2 Installation
133 pages
CP Harmony Endpoint AdminGuide
No ratings yet
CP Harmony Endpoint AdminGuide
375 pages
VMkernel Error Codes
100% (1)
VMkernel Error Codes
6 pages
EN02 Technical Fundamentals of Data Communications Networks
No ratings yet
EN02 Technical Fundamentals of Data Communications Networks
59 pages
Linux Basics
No ratings yet
Linux Basics
180 pages
TWS
No ratings yet
TWS
7 pages
Informatica Bottlenecks Overview
No ratings yet
Informatica Bottlenecks Overview
7 pages
ORACLE-BASE - Oracle Database 19c Installation On Oracle Linux 8 (OL8) - 2
No ratings yet
ORACLE-BASE - Oracle Database 19c Installation On Oracle Linux 8 (OL8) - 2
7 pages
CLI Commands
No ratings yet
CLI Commands
8 pages
Checkpoint Clustering
No ratings yet
Checkpoint Clustering
22 pages
ISO 9001: 2008 Certified Company
No ratings yet
ISO 9001: 2008 Certified Company
50 pages
Submitted By-Anurag Deyasi Information Technology SSEC, Bhilai
No ratings yet
Submitted By-Anurag Deyasi Information Technology SSEC, Bhilai
39 pages
Logical I/O: Julian Dyke Independent Consultant
No ratings yet
Logical I/O: Julian Dyke Independent Consultant
42 pages
Netstat Unix
100% (1)
Netstat Unix
5 pages
iSCSI: Install iSCSI Target
No ratings yet
iSCSI: Install iSCSI Target
47 pages
Monitoring The Primary and Standby Databases
No ratings yet
Monitoring The Primary and Standby Databases
10 pages
Oracle Backup Recovery
No ratings yet
Oracle Backup Recovery
18 pages
E 2010
No ratings yet
E 2010
17 pages
Tripwire
No ratings yet
Tripwire
16 pages
Carefully Use SUIM by Xpandion-1
100% (1)
Carefully Use SUIM by Xpandion-1
4 pages
ATRG ClusterXL R6x R7x R8x (09-June-2020)
No ratings yet
ATRG ClusterXL R6x R7x R8x (09-June-2020)
168 pages
Oracle Data Guard 11gR2 Administration Beginner's Guide
From Everand
Oracle Data Guard 11gR2 Administration Beginner's Guide
Nassyam Basha
No ratings yet
PowerHA - 1 PowerHA Consideration
No ratings yet
PowerHA - 1 PowerHA Consideration
17 pages
RepAdmin Examples
No ratings yet
RepAdmin Examples
5 pages
101.1 101-Backup and Restore
No ratings yet
101.1 101-Backup and Restore
9 pages
Facebook Hacking 2
100% (2)
Facebook Hacking 2
23 pages
Zoning Brocade Switches
No ratings yet
Zoning Brocade Switches
6 pages
Zoning Brocade Switches
No ratings yet
Zoning Brocade Switches
6 pages
Port Security Questions: Answer
No ratings yet
Port Security Questions: Answer
28 pages
Windows Group Policy InterviewWindows Group Policy Interview Questions Questions
0% (1)
Windows Group Policy InterviewWindows Group Policy Interview Questions Questions
4 pages
Oracle: Protect Your Data
From Everand
Oracle: Protect Your Data
Floribert TCHOKO
No ratings yet
CP R81.10 QoS AdminGuide
No ratings yet
CP R81.10 QoS AdminGuide
111 pages
File Server
No ratings yet
File Server
10 pages
50 REAL TIME LINUX Multiple Choice Questions and Answers LINUX Multiple Choice Questions PDF
100% (1)
50 REAL TIME LINUX Multiple Choice Questions and Answers LINUX Multiple Choice Questions PDF
16 pages
RAC 11g
100% (1)
RAC 11g
59 pages
Oracle Questions and Answers 4
No ratings yet
Oracle Questions and Answers 4
58 pages
2009 06 02 Library-Cache-Lock
No ratings yet
2009 06 02 Library-Cache-Lock
9 pages
Contention - Perf - Tuning - OraPub PHLOUG CBC Analysis 1d
No ratings yet
Contention - Perf - Tuning - OraPub PHLOUG CBC Analysis 1d
23 pages
69.DNS Interview Questions & Answers - Vishnuprasad.c
No ratings yet
69.DNS Interview Questions & Answers - Vishnuprasad.c
13 pages
156-585 Free Questions: Check Point Certified Troubleshooting Expert
No ratings yet
156-585 Free Questions: Check Point Certified Troubleshooting Expert
11 pages
CP R77 Multi-DomainSecurityManagement AdminGuide
No ratings yet
CP R77 Multi-DomainSecurityManagement AdminGuide
159 pages
Imperva 2 PDF
No ratings yet
Imperva 2 PDF
88 pages
CP R80.10 NexGenSecurityGateway Guide PDF
No ratings yet
CP R80.10 NexGenSecurityGateway Guide PDF
198 pages
McAfee Threat Prevention White Paper
No ratings yet
McAfee Threat Prevention White Paper
12 pages
001 Architecture
No ratings yet
001 Architecture
26 pages
Storage Area Network (SAN)
No ratings yet
Storage Area Network (SAN)
81 pages
Chapter 2 Review Questions
No ratings yet
Chapter 2 Review Questions
10 pages
Oracle SQL Tuning: For Day-to-Day Data Warehouse Support
No ratings yet
Oracle SQL Tuning: For Day-to-Day Data Warehouse Support
68 pages
Symantec Data Loss Prevention: at A Glance
No ratings yet
Symantec Data Loss Prevention: at A Glance
6 pages
Esx Boot Process
No ratings yet
Esx Boot Process
4 pages
Re-Establishing SIC SecureInternalCommunications For CheckPoint
No ratings yet
Re-Establishing SIC SecureInternalCommunications For CheckPoint
3 pages
Mastering SaltStack
From Everand
Mastering SaltStack
Joseph Hall
No ratings yet
The Quote
No ratings yet
The Quote
2 pages
Checkpoint - Backup
No ratings yet
Checkpoint - Backup
11 pages
Guide To Oracle Data Guard Fast - Failover
100% (1)
Guide To Oracle Data Guard Fast - Failover
27 pages
Oracle Database Mastery: Comprehensive Techniques for Advanced Application
From Everand
Oracle Database Mastery: Comprehensive Techniques for Advanced Application
Adam Jones
No ratings yet
Mastering Active Directory
From Everand
Mastering Active Directory
VICTOR P HENDERSON
No ratings yet
CCIE SD-WAN Lab 1 Workbook
100% (2)
CCIE SD-WAN Lab 1 Workbook
33 pages
Oracle WebLogic Server Second Edition
From Everand
Oracle WebLogic Server Second Edition
Gerardus Blokdyk
No ratings yet
ORACLE 12C Complete Self-Assessment Guide
From Everand
ORACLE 12C Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet