Now, you might be thinking, "I'm quite familiar with that mechanism. It's one of the features that I most like about PBG table spaces. When partition n of a PBG table space gets full, Db2 automatically adds partition n+1, to accommodate addition row-insert operations." The thing is, if that's your understanding of how PBG partition addition works then your understanding is a bit off the mark (aside from an exception I'll get to momentarily).
Here's the straight skinny: if a PBG table space has n partitions, Db2 will add partition n+1 not when partition n is full, but when the table space is full. Consider a PBG table space with 100 partitions. Partition 100 gets full. When will partition 101 be added? Partition 101 will be added when Db2 verifies that ALL of the table space's partitions are full: Db2 will check partitions prior to partition 100 - all 99 of them, if needs be - to see if space for a new row can be found. If no space is found in any of the table space's partitions, partition 101 will be added to hold a new row.
If receiving that bit of information has you felling as though you're about to break into a cold sweat, settle down. This doesn't necessarily mean that you've made a mistake in going the PBG route for a large table. Let's consider a couple of scenarios:
- Table gets inserts, but no deletes. In this case, yes, Db2 will check all partitions of the table space before adding a new one to accommodate a new row, but that space-check will be really fast: with a quick look at the "end" of a partition (referring to the end of the data in the partition), Db2 can determine if there is space available for a new row.
- Table has a continuously-ascending clustering key. With such a clustering key in effect (every new row has a clustering key value that is greater than that of any pre-existing row), a new row should always go to the end of the table. That can make APPEND YES a viable choice for the table, and when a table in a PBG table space is defined with APPEND YES then Db2 will NOT look in all of the table space's partitions before adding a new partition to accommodate a new row: if partition n is full then partition n+1 will be added, regardless of whether or not space for the row is available in a partition other than partition n (that's the exception to the check-all-partitions rule that I mentioned up in the second paragraph of this entry). Caveat: APPEND YES can lead to an insert "hot spot" at the end of the table, and that could be an issue in a high-volume insert situation. In that case, going with MEMBER CLUSTER for the PBG table space, in addition to APPEND YES for the table, could enable achievement of the desired level of insert throughput.
What if you have a situation that is not like either of the two scenarios described above? What if you have a PBG table space that has, or is likely to eventually have, a large number of partitions? Should you be anxious about the possibly negative performance impact of an all-partition search for space that would precede dynamic addition of a new partition to a PBG table space? I think maybe not. Below I've provided some points to ponder. One or more of them may assuage your worries.
- Partition size can be a mitigating factor. You have, of course, a number of options with regard to the DSSIZE specification for a PBG table space (DSSIZE indicates the size of the table space's partitions). Why do I bring this up? Because, for a given volume of data rows, a larger DSSIZE means fewer partitions, and as a general rule the backward search for space in previous partitions that precedes the automatic adding of a new partition to accommodate more data will be accomplished more quickly if there are fewer larger partitions versus a greater number of smaller partitions. Now, before jumping to the conclusion that DSSIZE 256G (the maximum value for a PBG table space) is the way to go, consider that there can be challenges in managing larger data sets. That said, it could be that a specification of DSSIZE 32G or 64G could be a better choice than 2G or 4G for some of your larger PGB table spaces.
- Online REORG can add a partition to a PBG table space before it's needed to accommodate new rows. You may already be aware that a REORG of an entire PBG table space can result in a new partition (or partitions) being added to the end of the table space - that will happen if the reestablishment of free space (e.g., as specified through the table space's PCTFREE value) via REORG requires additional space beyond that in the table space's existing partitions. What you might not know is that Db2 12 for z/OS added a nice enhancement pertaining to REORG of a partition (or partitions) of a PBG table space. The background: prior to Db2 12, if you executed a partition-level REORG for a PBG table space, and the data in a partition would not fit in the partition with free space reestablished by REORG, the utility execution would fail (that's why the ZPARM parameter REORG_IGNORE_FREESPACE was introduced - you could set that parameter's value to YES to cause Db2 to use 0 for PCTFREE and FREEPAGE for a partition-level REORG job). With Db2 12, if a partition of a PBG table space is REORGed and the partition's data won't fit in the partition with free space reestablished, Db2 will add a partition to the table space to hold the overflow rows - the utility job won't fail (and the REORG_IGNORE_FREESPACE parameter is no longer part of the ZPARM module).
- You can "pre-allocate" some partitions at CREATE TABLESPACE time. Do you think that the NUMPARTS option of CREATE TABLESPACE is only applicable to PBR (partition-by-range) table spaces? If so, time to update your thinking. When you create a PBG table space, NUMPARTS (if specified) tells Db2 how many partitions to initially create for the table space (unless you also specified DEFINE NO in the CREATE TABLESPACE statement); so, with a few partitions available from the get-go, it could be a while before a space constraint would induce Db2 to automatically add another partition for a PBG table space.
- You can add the "next" partition yourself, with a SQL statement. If you have a situation in which a PBG is approaching "full" status and you're concerned about an "exhaustive backwards partition space search" that would precede automatic addition of a new partition, you can preclude that space search by simply adding the next partition yourself. How would you do that? With an ALTER TABLE ADD PARTITION statement, of course. Oh, you were under the impression that that statement could only be used with a PBR table space? Join the club - I thought that myself, and for quite some time after it was no longer true (ALTER TABLE ADD PARTITION became applicable for PBG table spaces starting with Db2 10 for z/OS). Anyway, you could monitor (using Db2 real-time statistics information) the full-ness of a PBG (or of the last partition of a PBG - particularly in the case of a continuously-ascending partitioning key), and if it looks as though a new partition might soon be needed, add it yourself.
OK, to sum up: if you had assumed that a new partition got added to a PBG table space when the last partition could no longer accommodate a row insert, you need to set that assumption aside (except for the previously mentioned APPEND YES scenario) and understand that an exhaustive backwards search through preceding partitions - potentially through ALL preceding partitions - for row-insert space will occur before a new partition will be automatically added to the table space. Keep in mind that in some cases (especially when there are fewer larger partitions versus lots of smaller ones, and particularly when the table in question hasn't seen row-delete action), that space search will conclude quite rapidly. When the PBG table space has a large number of partitions, and when there have been DELETE as well as INSERT operations targeting the table in the table space, the exhaustive space search preceding automatic addition of a new partition could take longer to complete. In such situations, keep in mind that REORGs (of the entire table space, or - with Db2 12 - of one or more partitions of the table space) can result in added partitions that can hold new rows. Also remember that NUMPARTS can be used to "pre-allocate" partitions at CREATE TABLESPACE time, and that ALTER TABLE ADD PARTITION is a means by which you can add a new partition to a PBG table space at a time of your choosing.
Can PBG table spaces still provide ease-of-use benefits for DBAs, even for very large (i.e., multi-partition) tables? Sure they can. It's just that they are not quite so "set-it-and-forget-it" as you might have thought. With a clear-eyed understanding of how PBG partition addition works, you can take the steps needed to make PBG table spaces work well for you.
Thanks Robert. Nicely written! Hint on APPEND YES was an eye opener which one must ensure. Could you please share your thoughts on the usage of 'REORG_DROP_PBG_PARTS'. Is it wise to use ENABLE it or when should one think of disabling the zparm.
ReplyDeleteThanks,
Ram R.
If REORGs are likely to leave one or more trailing partitions of your PBG table spaces empty, but you expect that the associated tables will "grow back into" the empty partitions, you might want to retain them (by having REORG_DROP_PBG_PARTS set to DISABLE - the default value). On the other hand, if you think that PBG partitions made empty by a REORG are likely to remain empty (always, or at least for a long time), it could be a good idea to set REORG_DROP_PBG_PARTS to ENABLE.
DeleteAnother consideration: if you set REORG_DROP_PBG_PARTS to ENABLE and one or more of a PBG table space's trailing partitions made empty by a REORG are removed, that table space cannot be recovered to a point in time prior to the dropping of the empty trailing partitions. At some sites, point-in-time (PIT) recoveries are rare, and for those organizations this might not be a big deal. At other sites, PIT recovery actions are more common and so it may be prudent for those organizations to have REORG_DROP_PBG_PARTS set to DISABLE.
Robert
Good article. I had a few of those misconceptions myself.
ReplyDeleteThank you Robert !
ReplyDeleteRobert, Good article!! Just wondering in case of LOB PBG TS DB2 will implicitly create Aux tablespace for additional partition whenever they are created and it will be random name of ts and tables . Was wondering a way to identify before hand that new partition is going to be added soon so we create needed AUX TS , Tables and associated object.? Please advise.
ReplyDeleteIf a table in a PBG table space has a LOB column, and Db2 dynamically adds a partition to the table space, Db2 will implicitly add a LOB table space and an associated auxiliary table for the new partition's LOB column. And yes, Db2 has its own mecahnism for naming the implicitly created objects. If you want to create your own LOB table spaces for future partitions of the PBG table space and name the LOB table spaces as you want, you can do this by adding the new partitions to the PBG table space using ALTER TABLE ADD PARTITION, but that only works (referring to using LOB table spaces that you create) if the LOB table space for the current last partition of the PBG table space does not yet exist (see the information under the heading, "Adding a partition for a table that is in a partition-by-growth table space and has LOB columns," on the Db2 for z/OS documentation page at https://www.ibm.com/docs/en/db2-for-zos/12?topic=statements-alter-table.
DeleteRobert.
Robert
Your articles are GOLD !!
ReplyDeleteThanks for the positive feedback!
DeleteRobert
Hello Robert.
ReplyDeleteThanks for another very helpful article
-How really painful is for Db2 look for space in a PBG, for example in 8 full 2G partitions for a table with regular INSERT, DELETES (not append) and add a new one?
Do we need to worry about it, or probably we won’t notice?
-is there a package autobind when adding or dropping (due REORG_DROP_PBG_PARTS=ENABLE) partitions?
thanks.
1) I don't have numbers for you, but generally speaking this will not be a big deal - if it were, you would likely have noticed it in your environment. If it were to become an issue for one or more of your insert processes, you could take measures mentioned in the blog entry (e.g., MEMBER CLUSTER and APPEND YES).
Delete2) No. Autobind is a result of package invalidation, and the scenario you describe is not one that leads to package invalidation (actions that result in invalidation of dependent packages are noted on this page of the Db2 for z/OS documentation: https://www.ibm.com/docs/en/db2-for-zos/13?topic=applications-changes-that-invalidate-packages). Note that the impact of autobind on applications is greatly reduced in a Db2 13 environment, thanks to autobind phase-in (see https://robertsdb2blog.blogspot.com/2023/11/db2-13-for-zos-autobind-phase-in.html).
Robert
Hi Robert,
ReplyDeleteConsider there is a PBG defined with MAXPARTITIONS 16 and table with APPEND YES. If insert is happening in 16th partition and if it becomes full ,considering table is with APPEND YES will it look for freespace in other partitions or fail with 00C9009C.
Assuming there is space in at least one partition prior to partition 16, there should be no 00C9009C error (partition full) in this case. APPEND YES does not actually mean "insert the row at the end of the table, or fail the insert." Instead, it basically means, "when inserting into this table, pay no attention to clustering." Note that APPEND YES at the table level should be paired with MEMBER CLUSTER at the table space level - this reinforces "insert anywhere - just get the insert done" behavior.
DeleteRobert
Thank you Robert
DeleteHi Robert,
DeleteThe Sql reference guide (v12-13) indicates that if I omit both MAXPARTITIONS & NUMPARTS, then I will get a PBG TSP.
A few questions on a TSP where I have omitted both MAXPARTITIONS & NUMPARTS:
a Would the created PBG be ok to use for a small table, which will never be partitioned ?
b Would the created PBG be ok to use for a medium to large table, which I may consider splitting its Reorgs/Loads
to run on partition level later on as the table grows ?
c If I opt to split the Reorgs/Loads on a PBG to run on partition level, whats your index recommendations to avoid locking & achieve
partition independence
d Whats your thoughts/recommendations (can't think of a reason why I would want to do that!) re. converting a PBG to PBR or visa versa ?
a) Yes. The small table will indeed be partitioned, as all universal table spaces are partitioned - it just won't grow beyond partition 1.
Deleteb) Yes.
c) Main thing to keep in mind is that indexes on a table in a PBG table space will always be non-partitioned. Couple of implications there: 1) even if you REORG just one partition of the table space, any indexes on the table will be REORG-ed in their entirety; and, 2) achieving full utility partition independence, especially where LOAD is concerned, requires use of partitioned indexes, and such indexes can only be defined on tables in PBR table spaces.
d) Can't convert a PBR table space to PBG in an online way (not yet, anyway - this is a known requirement and Db2 for z/OS development is working on it). As to the value of online conversion of a PBG table space to PBR, see the blog entry at https://robertsdb2blog.blogspot.com/2022/06/db2-for-zos-online-path-from-partition.html.
Robert
Hi Robert, Many thanks for the replies.
ReplyDeleteI would appreciate your thoughts on the following approach, which I am considering.
Since the PBR UTS tablespace has the most features & enhancements, & v13 will add the RPN feature, then why not always create my tablespaces to cater for PBR ?
And in the case where I just need a normal table, which does not need to be partitioned, then I can create it with 'NUMPARTS 1'.
From my reading so far, it seems if I go down this path, i'll eliminate performance problems, & will have more features available to address future issues/problems.
Do you see any problems with this approach, or am I missing something ?
That would be a highly unusual approach that I would not recommend. [Also, correction: RPN was introduced with Db2 12 - see https://robertsdb2blog.blogspot.com/2020/08/db2-12-for-zos-rpn-table-spaces-new.html.]
DeleteHere is the REALLY important thing to keep in mind: the advantages of range-partitioning a table are essentially advantages related to large tables. What is a "large table?" Different people will answer that question differently. I'd have a hard time calling a table with fewer that 1 million rows "large." I'd also likely not refer to a table with, say, 50 million rows, as "small." Basically, in my mind, a table with fewer than 1 million rows is for sure "small," and a table with more than 100 million rows is for sure "large." Between 1 million and 100 million rows there can be differences of opinion on this matter, but that's not really so important. The main point is this: in a typical Db2 for z/OS database, the large majority of tables will be on the small side - some might hold a few thousand rows, or a few tens of thousands or a few hundreds of thousands of rows. For most of these fewer-than-1-million-rows tables, I would be hard pressed to argue in favor of range-partitioning. Those tables would be best placed in PBG table spaces. I'd generally want to range-partition tables with more than 100 million rows, and I'd want those range-partitioned tables in RPN table spaces. For tables holding 1 million to 100 million rows, the PBG versus PBR decision might be on a case-by-case basis, and there's no one set of rules for making that decision - you go with what makes sense for the table in question.
Something else for you to consider: there is not a single range-partitioned table in the Db2 catalog or directory. Virtually all of the table spaces for those system tables are PBG (a few "stragglers" are still in non-universal table spaces - that situation will likely be addressed by the Db2 for z/OS development team at a future time). If there were real advantages to going with PBR for all table spaces, Db2 development would have gone that route for the catalog and directory table spaces.
Robert
Really appreciate the clarification, the tips & insights you provide are invaluable, thanks again.
ReplyDelete