Robert's Db2 blog: DB2 for z/OS: Running REORG to Reclaim Disk Space

Sunday, March 26, 2017

DB2 for z/OS: Running REORG to Reclaim Disk Space

Think of why you run the DB2 for z/OS REORG utility, and a number of reasons are likely to come quickly to mind: to restore row order per a table's clustering key; to reestablish free space (for inserts and/or for updates); to remove the AREO* status set for a table space following (for example) an ALTER TABLE ADD COLUMN operation; or to materialize a pending DDL change such as an enlargement of a table space's DSSIZE. How about disk space reclamation? If that REORG motivation has not previously occurred to you, perhaps it should.

Recently, a DBA at a large DB2 for z/OS site communicated to me the success that his team has had in reclaiming substantial amounts of disk space through online reorganization of certain table spaces. He also asked for a recommendation with regard to identifying table spaces for which a REORG could potentially deliver significantly reduced disk space consumption. In this blog entry, I'll describe the disk space reclamation scenario reported by the referenced DBA, I'll explain why there was space to be reclaimed in some of the table spaces administered by the DBA, and I'll provide the "reclamation indicator" metric that I suggested to the DBA as a means of identifying table spaces that could be reorganized in order to free up disk space.

First, the scenario. At the DBA's site, there are some tables, in segmented table spaces ("traditional" segmented table spaces, as opposed to universal table spaces, which also happen to be segmented), that have these key characteristics: they are clustered by a continuously-ascending key (so that "new" rows go to the "end" of the table), and the number of inserts into the table is roughly equaled by the number of rows that are deleted from the table over a period of time.

The DB2 DBA knew that for table spaces with the above-described characteristics, REORGs were not needed to maintain "clusteredness," because of the continuously-ascending clustering key that sent new rows to the end of the table (at least, clustering would remain in good shape until the table space reached its size limit -- more on this in a moment). For the same reason, free space for inserts in "interior" pages of the table space was not a concern. Still, with DB2 real-time statistics showing a very large number of inserts since the last REORG of a couple of these table spaces, the DBA determined that reorganizations might be in order. Online REORGs of the table spaces were executed, and the result was a freeing up of 64 GB of disk space: one table space went from 21 to 4 data sets of 2 GB apiece, and the other went from 17 data sets to 2 (a DB2 segmented table space is comprised of up to 32 data sets of 2 GB apiece, and that is why its size limit is 64 GB).

Why was there so much unused space in these table spaces? Because the continuously-ascending clustering key kept pushing the "end" of the table spaces "outward." Why would that happen? Why would DB2 grow these table spaces as a result of inserts, given the like number of row-delete operations that were targeting the associated tables? Shouldn't DB2 have been using the space freed up by deletes to accommodate new inserts, without growing the table space's size? Actually, DB2 was working as designed. It's true that, given a continuously-ascending clustering key and some deletes of older rows from the "front" of a table space, DB2 can "wrap" to the front and start inserting new rows in space cleared by deletes, but that will only happen if DB2 cannot extend the table space (i.e., if DB2 cannot make the table space larger). If DB2 can extend a segmented table space, it will in order to preserve a table's row-clustering order; so, in advance of hitting the 64 GB size limit for a segmented table space, DB2 would keep making the table space larger so that it could keep adding rows to the end of a table (assuming a continuously-ascending clustering key), and deletes of older rows would result in ever-larger amounts of available-but-unused space in the table space. That's why the disk footprint of the two table spaces became so much smaller following reorganization.

[It is important to keep in mind that, given a continuously-ascending clustering key and at least some row-delete operations, DB2 will insert new rows in the "front" of a segmented table space, using space freed up by DELETEs, if the table space cannot be made any larger (either because of reaching the 64 GB limit or as a result of running into a maximum-extents or a maximum-volumes situation). In that case, "wrapping to the front" for new inserts is better than failing the inserts.]

Having achieved this disk space reclamation success through REORGs, the aforementioned DBA wanted to add "potential for significant disk space reclamation" to the criteria used at his site for identifying table spaces that should be reorganized (a good proactice -- REORG table spaces when you have a good reason for doing so, not just "because it's been X amount of time since the last time this table space was REORGed"). How could he and his colleagues spot table spaces with large amounts of unused space? My recommendation: use for this purpose the ratio of disk space occupied by the table space to space in the table space occupied by rows in the table space. For the numerator, I'd use the SPACE value in the row for the table space in the SYSTABLESPACE catalog table. That value is updated when the STOSPACE utility is executed, so you would want to run STOSPACE on a regular basis (that should not be a big deal -- STOSPACE should be a very inexpensive utility to execute, CPU-wise). For the denominator, I would use the product of TOTALROWS from SYSTABLESPACESTATS (set by REORG and updated when INSERTs and DELETEs are executed) and AVGROWLEN in SYSTABLESPACE (updated by RUNSTATS, or by in-line statistics collected during REORG or LOAD). You can decide when that ratio would prompt you to run REORG to reclaim space. Would you do that when disk-space-to-row-space hits 2 (i.e., when the size of the table space is 2X the space occupied by rows)? When it hits 3? When it hits 4? One of those values might be reasonable for your environment.

One more thing: I have focused on traditional segmented table spaces in this blog entry because that is the table space type to which space reclamation via REORG is most relevant. For a range-partitioned table space, a given partition's size limit is determined by the DSSIZE specification, and the same is true for a partition-by-growth table space. Yes, you could see a partition-by-growth table space come to contain a high percentage of unused space given the combination of a continuously-ascending clustering key and a good deal of DELETE activity, but you could put a limit on that growth by way of a not-larger-than-needed MAXPARTITIONS value. With that said, even with range-partitioned and partition-by-growth table spaces you could see situations in which the ratio of table space size to space occupied by rows (the ratio described in the preceding paragraph) could get to be high enough to make a REORG attractive from a disk space reclamation perspective. And here there's some good news: starting with DB2 11 for z/OS, you can tell DB2 to drop partitions of a partition-by-growth table space made empty by a REORG or the entire table space (that functionality is enabled via the REORG_DROP_PBG_PARTS parameter in ZPARM).

So, add disk space reclamation to your reasons for running REORG (if you have not already done so), and consider using the ratio I've provided to look for candidate table spaces.

100 comments:

AnonymousMarch 29, 2017 at 8:45 AM
Why TOTALROWS*AVGROWLEN, why not DATASIZE?

Michael Harper, TD Bank
ReplyDelete
Replies
Elvio ComunelloFebruary 13, 2019 at 12:02 PM
Hi Robert, excuse me if this is not the right blog post.
Being in V12 we noticed that after migrating to v12 when inserting records,
the allocated space of the tablespace increases considerably,
also extending them and decreasing substantially the Pct of space used by ACTIVE tables.
when we reorganize it returns to the original sizes.
Before with v11 it did not happen.
The tablespace is a UTS PBR

Why could this behavior have changed?

I attach the history
Note that from the migration of db2 to v12 in 2018-11-23
changes the increment of allocate space while increasing the number of rows
The first reorg turned to RRF that's why the change in allocacion

Part Date/Time of Update Space (KB) Rows Pct Ac Pct Dp Exts
---- ------------------- ----------- ----------- ------ ------ ------
2 2019-02-10-20.29.11 1617504 384299 0 0 48
2 2019-02-03-14.12.22 1357776 375146 0 0 46
2 2019-01-27-18.18.54 1065456 368624 1 0 37
2 2019-01-20-16.49.38 809088 363353 1 0 32
2 2019-01-13-15.01.41 510720 357345 2 0 24
2 2019-01-06-13.31.03 311136 347015 3 0 19
2 2018-12-30-18.53.03 184464 339751 5 0 15
2 2018-12-23-16.09.50 24192 334111 43 0 6
2 2018-12-16-17.21.36 762048 326013 1 0 35
2 2018-12-09-17.51.33 510720 317613 2 0 29
2 2018-12-02-18.38.16 311136 306212 3 0 20
2 2018-11-25-17.57.04 38304 301272 25 0 7
2 2018-11-18-15.35.31 13104 295903 73 0 3
2 2018-11-11-17.14.28 13104 287465 71 0 3
2 2018-11-04-16.22.33 13104 277529 87 0 3
2 2018-10-28-17.23.37 13104 269506 67 0 5
2 2018-10-21-16.15.35 13104 263974 65 0 5
2 2018-10-14-17.58.13 13104 258109 64 0 5
2 2018-10-07-16.02.30 13104 248715 87 0 5
2 2018-09-30-14.17.34 13104 238653 59 0 3
2 2018-09-23-16.06.47 8736 232407 87 0 3
2 2018-09-16-14.44.28 8736 226012 84 0 3

Thanks,
Elvio
ReplyDelete
Replies
Elvio ComunelloFebruary 15, 2019 at 4:58 AM

Thanks Robert, I'll deal with the problem with Ibm Support Center.
ReplyDelete
Replies
arunaMay 12, 2023 at 2:29 AM
Rob,I ran reorg which has list of indexes using listdef,the job failed as there were claimers .I saw the display claimers results in the sysprint.Does the reorg list the cliamers results,in which phase? I have never seen display db(*) claimers messages in any reorg job failures.Any idea?
ReplyDelete
Replies
AnonymousJuly 19, 2023 at 3:29 AM
Hi Robert,I have scheduled reorg job which fails whenever there are claimers on the object.Is there a way to skip this error?
ReplyDelete
Replies
AnonymousAugust 3, 2023 at 9:05 AM
Hi Robert,
When I run reorg on a database level(it has lob tablespaces),it tries to reorg lob's along with base tablespace and as it proceeds through the other tablespaces under database,when it comes to actual lob tablespaces in the databases which is already been reorged along with base earlier it fails stating ic dataset already exist.How do I bypass this error?
ReplyDelete
Replies
IsaiasSeptember 13, 2023 at 11:31 AM
Hi Robert... regarding to lob table spaces... does DB2 insert new rows using space cleared by deletes as it does with UTS ?
ReplyDelete
Replies
AkilSeptember 29, 2023 at 3:50 AM
Hello,I ran into below issue during reorg
DSNT500I 270 01:46:21.45 DSNUGBAC - RESOURCE UNAVAILABLE
REASON 00C200E1
TYPE 00000220
NAME DBxxx.DSNDBC.xxx.xxxx.I0001.A001
DSNU017I 270 01:46:21.45 DSNUGBAC - UTILITY DATA BASE SERVICES MEMORY EXECUTION
CAUSE=X'00D70100'
The log gives me below info.What does that mean ?
IEC161I 069(00000008,0000271C)-162,DBxxDBM1,IEFPROC DB2UDBM1, 508
IEC161I A0022243,,,DBxx.DSNDBC.xxx.xxx.I0001.A001,,
IEC161I CATALOG.DEVDB2A
IEC161I 069(00000008,0000271C)-162,DBxxDBM1,IEFPROC DB2UDBM1, 638
IEC161I A0022246,,,DBxx.DSNDBC.xxx.xx.I0001.A001,,
IEC161I CATALOG.DEVDB2A
reorg card:
REORG TABLESPACE LIST tsx
LOG NO
NOSYSREC
SORTDATA NO
SORTDEVT SYSDA
STATISTICS TABLE(ALL) INDEX(ALL)
COPYDDN COPYTAP
TIMEOUT TERM
MAXRO 5 DRAIN_WAIT 15 RETRY 10 RETRY_DELAY 5
UNLDDN SYSREC
SHRLEVEL CHANGE FASTSWITCH YES
ReplyDelete
Replies
akilOctober 11, 2023 at 8:05 AM
I encountered ABENDU0046 in a reorg.I have included sysut1,syserr,sysrec dd cards(with templates & listdef).
DSNU1038I 281 03:42:21.45 DSNUGDYN - DATASET ALLOCATED. TEMPLATE=COPYDSN
DDNAME=SYS000xx, FILE SEQUENCE=0001
DSN=xxxxx
DSNU2904I 281 03:42:21.46 DSNURPCT - DATA RECORDS WILL BE UNLOADED VIA TABLE SPACE SCAN FROM TABLESPACE
xxxxxx
DSNU3340I 281 03:42:21.46 DSNUGSRT - UTILITY PERFORMS DYNAMIC ALLOCATION OF SORT DISK SPACE
DSNU016I 281 03:42:39.92 DSNUGBAC - UTILITY BATCH MEMORY EXECUTION ABENDED, REASON=X'0000'
I tried having region =0m but nothing worked.
ReplyDelete
Replies
AnonymousOctober 13, 2023 at 10:39 AM
Hello,I get informational copy pending messages for the indexes in reorg job output.But the tablespaces in which the index is defined ,is defined as 'logged' attribute.Then why does it place in informational copy pending?
ReplyDelete
Replies
AnonymousOctober 21, 2023 at 1:12 AM
I have given nosysrec in my reorg job,but why would it expect sysrec dd card?
DSNU047I 285 03:26:22.71 DSNURORG - A REQUIRED DD CARD OR TEMPLATE IS MISSING. NAME=SYSREC
DSNU2903I 285 03:26:22.72 DSNURORG - PARTITION LEVEL INLINE COPY DATASETS WILL BE ALLOCATED
DSNU012I 285 03:26:24.78 DSNUGBAC - UTILITY EXECUTION TERMINATED, HIGHEST TURN CODE=8
reorg statment:
REORG TABLESPACE LIST TSDEF99
LOG NO
SORTDATA NO
SORTKEYS
NOSYSREC
SORTDEVT SYSDA SORTNUM 255
STATISTICS TABLE(ALL) INDEX(ALL)
KEYCARD FREQVAL NUMCOLS 1 COUNT 10
REPORT YES
UPDATE ALL
COPYDDN CPYTPRT
MAXRO 5 DRAIN_WAIT 15 RETRY 10 RETRY_DELAY 5
SHRLEVEL CHANGE FASTSWITCH YES
parallel 0
ReplyDelete
Replies
AnonymousDecember 10, 2023 at 12:39 AM
Whenever I want to view rts of a tablespace,I query systablespacestats.How does the data gets written/updated to systablespacestats ( termed as "rts").What's the prcoess involved
ReplyDelete
Replies
AnonymousJanuary 14, 2024 at 5:08 AM
Why/when do we need to have/include keepdictionary in reorg?
I see below messages appear in reorg job output.How does the compress attribute related to keepdictionary
DSNU242I + 013 DSNURFUI - KEEPDICTIONARY OR COPYDICTIONARY REQUESTED BUT COMPRESS ATTRIBUTE NOT DEFINED
FOR TABLE SPACE TS, PARTITION 1
DSNU242I + 013 DSNURFUI - KEEPDICTIONARY OR COPYDICTIONARY REQUESTED BUT COMPRESS ATTRIBUTE NOT DEFINED
FOR TABLE SPACE TS, PARTITION 2
ReplyDelete
Replies
AnonymousJanuary 16, 2024 at 8:32 AM
I surfed to know on this compression dictionary as I have not heard/read about it.Does this compress dictionary applies/applied only for reorg & load? when and all do this compression dictionary gets invoked?
ReplyDelete
Replies
AnonymousMarch 19, 2024 at 4:02 AM
I submitted an reorg for partlevel with below copy dataset specification.In the sysprint,I'm able to see dataset allocation messages,image copy completed messages,switch phase complete messages as well.But at end of sysprint,it states db2 is unable to unallocate image copy dataset(that was already allocated in same job) and job failed as well.Why would reorg try to unallocate the ic dataset that has been has already allocated during inline copy process in same job?
TEMPLATE COPY DSN '&SS..&TS..P&PA..D&MO.&DA.&YE(3,2).'
UNIT CTAPE BUFNO 60 RETPD 7 VOLCNT(99)
DISP (NEW,CATLG,CATLG) STACK NO TRTCH NOCOMP
DSNUGDYN - DATASET ALLOCATED. TEMPLATE=COPY
DDNAME=SYS00001, FILE SEQUENCE=0001
DSN="dsname"
DSNUGDYN - DATASET ALLOCATED. TEMPLATE=COPYTAP
DDNAME=SYS00002, FILE SEQUENCE=0001
DSN="dsname"
DSNURPCT - MAXIMUM UTILITY PARALLELISM IS 35 BASED
ICS

DSNURBID - COPY PROCESSED FOR TABLESPACE EMP
NUMBER OF PAGES=1906148
AVERAGE PERCENT FREE SPACE PER PAGE = 8.09
PERCENT OF CHANGED PAGES =100.00
ELAPSED TIME=04:03:27
DSNURBID - COPY PROCESSED FOR TABLESPACE EMP
NUMBER OF PAGES=2443775
AVERAGE PERCENT FREE SPACE PER PAGE = 8.27
PERCENT OF CHANGED PAGES =100.00
ELAPSED TIME=04:03:27
DSNURSWT - SWITCH PHASE COMPLETE, ELAPSED TIME = 00
DSNURSWT - DB2 IMAGE COPY SUCCESSFUL FOR TABLESPACE EMP
DSNURSWT - DB2 IMAGE COPY SUCCESSFUL FOR TABLESPACE
DSNUSUTP - SYSTABLEPART CATALOG UPDATE FOR EMP

DSNUSUTS - SYSTABLESPACE CATALOG UPDATE FOR EMP
DSNUSEF2 - RUNSTATS CATALOG TIMESTAMP = 2024-01
DSNU031I 069 12:33:39.50 DSNUGSDA - UNABLE TO UNALLOCATE "dsname"
DSNUGBAC - UTILITY BATCH MEMORY EXECUTION ABENDED

ReplyDelete
Replies
AnonymousMarch 21, 2024 at 12:06 PM
I understand the ICOPY will be placed on an index defined with copy yes when a reorg is run on the tablespace.I have an index placed in icopy,but the reorg was not run on that tablespace, so I surfed to know what all activities would place index in icopy.But no luck.
Any thoughts?
ReplyDelete
Replies
AnonymousMay 8, 2024 at 3:00 AM
Why would an reorg expect syspunch ddname? .I dont have sysrec dd/unload in reorg statment .
A REQUIRED DD CARD OR TEMPLATE IS MISSING. NAME=SYSPUNCH
ReplyDelete
Replies
AnonymousSeptember 26, 2024 at 3:03 AM
Per IBM manual, STATISTICS keyword is not applicable for LOB tablespaces
https://www.ibm.com/docs/en/db2-for-zos/12?topic=tablespace-syntax-options-reorg-control-statement

What would be the concept of not allowing statistics for LOB
how does it differ from having run with stats for non lob tablespaces
ReplyDelete
Replies
AnonymousOctober 22, 2024 at 1:39 AM
Hello Robert,
I added an column(not null with default value) at end of table via alter ,but I do not see warning message stating objects are placed in advisory reorg state .Why so?
When I checked the status of object it was "AREO*".

Also ,I'm surprised to see that without even running reorg on table, an update sql statement to set the column to certain value in that table ,removed AREO* .How is that possible?

ReplyDelete
Replies
AnonymousOctober 23, 2024 at 1:11 AM
When alter is applied to add few column ,the table is placed in AREO* status(still it will not restrict access as you said) but it is advisable to run an reorg as the status suggests. But when I ran update to set all existing rows of an integer column (not null with default '3') to zero's. The status was removed.
ReplyDelete
Replies
RobertOctober 23, 2024 at 11:43 AM
OK. Start with the reason why a REORG of the table space is advised following the ALTER TABLE action that added a column to the table (that is the meaning of AREO* status). The REORG is advised for performance reasons. The ALTER TABLE with ADD COLUMN is a so-called "immediate" change, in that the table's definition in the catalog is immediately changed when the ALTER statement is executed. That being the case, the added column is immediately "there," from the perspective of an application program. How can that be? Well, if there is an update of the newly-added column that affects, for example, one row, the new column will be physically materialized for that one row to support execution of the UPDATE statement. That "on-the-fly" materialization of the new column on an as-needed basis involves some overhead. A REORG of the table space will physically materialize the new column for all of the table's rows, eliminating the overhead of "on-the-fly" materialization for a single row (or a subset of the table's rows, depending on the scope of the UPDATE statement). When an UPDATE of a newly-added column affects all of a table's rows, the new column will be physically materialized for all of the rows, and the reason for the AREO* status (REORG in order to physically materialize the new column for all rows) therefore goes away.

Robert
ReplyDelete
Replies
AnonymousOctober 25, 2024 at 7:25 AM
Makes sense !
ReplyDelete
Replies

Add comment

Note: Only a member of this blog may post a comment.