Add ONLY support for VACUUM and ANALYZE

Since autovacuum does not trigger an ANALYZE for partitioned tables,
users must perform these manually.  However, performing a manual ANALYZE
on a partitioned table would always result in recursively analyzing each
partition and that could be undesirable as autovacuum takes care of that.
For partitioned tables that contain a large number of partitions, having
to analyze each partition could take an unreasonably long time, especially
so for tables with a large number of columns.

Here we allow the ONLY keyword to prefix the name of the table to allow
users to have ANALYZE skip processing partitions.  This option can also
be used with VACUUM, but there is no work to do if VACUUM ONLY is used on
a partitioned table.

This commit also changes the behavior of VACUUM	and ANALYZE for
inheritance parents.  Previously inheritance child tables would not be
processed when operating on the parent.  Now, by default we *do* operate
on the child tables.  ONLY can be used to obtain the old behavior.
The release notes should note this as an incompatibility.  The default
behavior has not changed for partitioned tables as these always
recursively processed the partitions.

Author: Michael Harris <harmic@gmail.com>
Discussion: https://postgr.es/m/CADofcAWATx_haD=QkSxHbnTsAe6+e0Aw8Eh4H8cXyogGvn_kOg@mail.gmail.com
Discussion: https://postgr.es/m/CADofcAXVbD0yGp_EaC9chmzsOoSai3jcfBCnyva3j0RRdRvMVA@mail.gmail.com
Reviewed-by: Jelte Fennema-Nio <postgres@jeltef.nl>
Reviewed-by: Melih Mutlu <m.melihmutlu@gmail.com>
Reviewed-by: Atsushi Torikoshi <torikoshia@oss.nttdata.com>
Reviewed-by: jian he <jian.universality@gmail.com>
Reviewed-by: David Rowley <dgrowleyml@gmail.com>
This commit is contained in:
David Rowley 2024-09-24 18:03:40 +12:00
parent bbba59e69a
commit 62ddf7ee9a
8 changed files with 230 additions and 45 deletions

View File

@ -3778,12 +3778,14 @@ VALUES ('Albany', NULL, NULL, 'NY');
not <literal>INSERT</literal> or <literal>ALTER TABLE ... not <literal>INSERT</literal> or <literal>ALTER TABLE ...
RENAME</literal>) typically default to including child tables and RENAME</literal>) typically default to including child tables and
support the <literal>ONLY</literal> notation to exclude them. support the <literal>ONLY</literal> notation to exclude them.
Commands that do database maintenance and tuning The majority of commands that do database maintenance and tuning
(e.g., <literal>REINDEX</literal>, <literal>VACUUM</literal>) (e.g., <literal>REINDEX</literal>) only work on individual, physical
typically only work on individual, physical tables and do not tables and do not support recursing over inheritance hierarchies.
support recursing over inheritance hierarchies. The respective However, both <literal>VACUUM</literal> and <literal>ANALYZE</literal>
behavior of each individual command is documented in its reference commands default to including child tables and the <literal>ONLY</literal>
page (<xref linkend="sql-commands"/>). notation is supported to allow them to be excluded. The respective
behavior of each individual command is documented in its reference page
(<xref linkend="sql-commands"/>).
</para> </para>
<para> <para>
@ -4854,11 +4856,12 @@ ALTER TABLE measurement_y2008m02 INHERIT measurement;
<listitem> <listitem>
<para> <para>
If you are using manual <command>VACUUM</command> or Manual <command>VACUUM</command> and <command>ANALYZE</command>
<command>ANALYZE</command> commands, don't forget that commands will automatically process all inheritance child tables. If
you need to run them on each child table individually. A command like: this is undesirable, you can use the <literal>ONLY</literal> keyword.
A command like:
<programlisting> <programlisting>
ANALYZE measurement; ANALYZE ONLY measurement;
</programlisting> </programlisting>
will only process the root table. will only process the root table.
</para> </para>

View File

@ -5525,9 +5525,9 @@ FROM pg_stat_get_backend_idset() AS backendid;
<note> <note>
<para> <para>
Note that when <command>ANALYZE</command> is run on a partitioned table, Note that when <command>ANALYZE</command> is run on a partitioned table
all of its partitions are also recursively analyzed. without the <literal>ONLY</literal> keyword, all of its partitions are
In that case, <command>ANALYZE</command> also recursively analyzed. In that case, <command>ANALYZE</command>
progress is reported first for the parent table, whereby its inheritance progress is reported first for the parent table, whereby its inheritance
statistics are collected, followed by that for each partition. statistics are collected, followed by that for each partition.
</para> </para>

View File

@ -31,7 +31,7 @@ ANALYZE [ ( <replaceable class="parameter">option</replaceable> [, ...] ) ] [ <r
<phrase>and <replaceable class="parameter">table_and_columns</replaceable> is:</phrase> <phrase>and <replaceable class="parameter">table_and_columns</replaceable> is:</phrase>
<replaceable class="parameter">table_name</replaceable> [ ( <replaceable class="parameter">column_name</replaceable> [, ...] ) ] [ ONLY ] <replaceable class="parameter">table_name</replaceable> [ * ] [ ( <replaceable class="parameter">column_name</replaceable> [, ...] ) ]
</synopsis> </synopsis>
</refsynopsisdiv> </refsynopsisdiv>
@ -142,9 +142,12 @@ ANALYZE [ ( <replaceable class="parameter">option</replaceable> [, ...] ) ] [ <r
The name (possibly schema-qualified) of a specific table to The name (possibly schema-qualified) of a specific table to
analyze. If omitted, all regular tables, partitioned tables, and analyze. If omitted, all regular tables, partitioned tables, and
materialized views in the current database are analyzed (but not materialized views in the current database are analyzed (but not
foreign tables). If the specified table is a partitioned table, both the foreign tables). If <literal>ONLY</literal> is specified before
inheritance statistics of the partitioned table as a whole and the table name, only that table is analyzed. If <literal>ONLY</literal>
statistics of the individual partitions are updated. is not specified, the table and all its inheritance child tables or
partitions (if any) are analyzed. Optionally, <literal>*</literal>
can be specified after the table name to explicitly indicate that
inheritance child tables (or partitions) are to be analyzed.
</para> </para>
</listitem> </listitem>
</varlistentry> </varlistentry>
@ -284,22 +287,23 @@ ANALYZE [ ( <replaceable class="parameter">option</replaceable> [, ...] ) ] [ <r
<command>ANALYZE</command> gathers two sets of statistics: one on the rows <command>ANALYZE</command> gathers two sets of statistics: one on the rows
of the parent table only, and a second including rows of both the parent of the parent table only, and a second including rows of both the parent
table and all of its children. This second set of statistics is needed when table and all of its children. This second set of statistics is needed when
planning queries that process the inheritance tree as a whole. The child planning queries that process the inheritance tree as a whole. The
tables themselves are not individually analyzed in this case. autovacuum daemon, however, will only consider inserts or updates on the
The autovacuum daemon, however, will only consider inserts or parent table itself when deciding whether to trigger an automatic analyze
updates on the parent table itself when deciding whether to trigger an for that table. If that table is rarely inserted into or updated, the
automatic analyze for that table. If that table is rarely inserted into inheritance statistics will not be up to date unless you run
or updated, the inheritance statistics will not be up to date unless you <command>ANALYZE</command> manually. By default,
run <command>ANALYZE</command> manually. <command>ANALYZE</command> will also recursively collect and update the
statistics for each inheritance child table. The <literal>ONLY</literal>
keyword may be used to disable this.
</para> </para>
<para> <para>
For partitioned tables, <command>ANALYZE</command> gathers statistics by For partitioned tables, <command>ANALYZE</command> gathers statistics by
sampling rows from all partitions; in addition, it will recurse into each sampling rows from all partitions. By default,
partition and update its statistics. Each leaf partition is analyzed only <command>ANALYZE</command> will also recursively collect and update the
once, even with multi-level partitioning. No statistics are collected for statistics for each partition. The <literal>ONLY</literal> keyword may be
only the parent table (without data from its partitions), because with used to disable this.
partitioning it's guaranteed to be empty.
</para> </para>
<para> <para>

View File

@ -42,7 +42,7 @@ VACUUM [ ( <replaceable class="parameter">option</replaceable> [, ...] ) ] [ <re
<phrase>and <replaceable class="parameter">table_and_columns</replaceable> is:</phrase> <phrase>and <replaceable class="parameter">table_and_columns</replaceable> is:</phrase>
<replaceable class="parameter">table_name</replaceable> [ ( <replaceable class="parameter">column_name</replaceable> [, ...] ) ] [ ONLY ] <replaceable class="parameter">table_name</replaceable> [ * ] [ ( <replaceable class="parameter">column_name</replaceable> [, ...] ) ]
</synopsis> </synopsis>
</refsynopsisdiv> </refsynopsisdiv>
@ -401,8 +401,13 @@ VACUUM [ ( <replaceable class="parameter">option</replaceable> [, ...] ) ] [ <re
<listitem> <listitem>
<para> <para>
The name (optionally schema-qualified) of a specific table or The name (optionally schema-qualified) of a specific table or
materialized view to vacuum. If the specified table is a partitioned materialized view to vacuum. If <literal>ONLY</literal> is specified
table, all of its leaf partitions are vacuumed. before the table name, only that table is vacuumed. If
<literal>ONLY</literal> is not specified, the table and all its
inheritance child tables or partitions (if any) are also vacuumed.
Optionally, <literal>*</literal> can be specified after the table name
to explicitly indicate that inheritance child tables (or partitions) are
to be vacuumed.
</para> </para>
</listitem> </listitem>
</varlistentry> </varlistentry>

View File

@ -851,7 +851,7 @@ vacuum_open_relation(Oid relid, RangeVar *relation, bits32 options,
/* /*
* Given a VacuumRelation, fill in the table OID if it wasn't specified, * Given a VacuumRelation, fill in the table OID if it wasn't specified,
* and optionally add VacuumRelations for partitions of the table. * and optionally add VacuumRelations for partitions or inheritance children.
* *
* If a VacuumRelation does not have an OID supplied and is a partitioned * If a VacuumRelation does not have an OID supplied and is a partitioned
* table, an extra entry will be added to the output for each partition. * table, an extra entry will be added to the output for each partition.
@ -879,11 +879,15 @@ expand_vacuum_rel(VacuumRelation *vrel, MemoryContext vac_context,
} }
else else
{ {
/* Process a specific relation, and possibly partitions thereof */ /*
* Process a specific relation, and possibly partitions or child
* tables thereof.
*/
Oid relid; Oid relid;
HeapTuple tuple; HeapTuple tuple;
Form_pg_class classForm; Form_pg_class classForm;
bool include_parts; bool include_children;
bool is_partitioned_table;
int rvr_opts; int rvr_opts;
/* /*
@ -944,20 +948,31 @@ expand_vacuum_rel(VacuumRelation *vrel, MemoryContext vac_context,
MemoryContextSwitchTo(oldcontext); MemoryContextSwitchTo(oldcontext);
} }
/*
* Vacuuming a partitioned table with ONLY will not do anything since
* the partitioned table itself is empty. Issue a warning if the user
* requests this.
*/
include_children = vrel->relation->inh;
is_partitioned_table = (classForm->relkind == RELKIND_PARTITIONED_TABLE);
if ((options & VACOPT_VACUUM) && is_partitioned_table && !include_children)
ereport(WARNING,
(errmsg("VACUUM ONLY of partitioned table \"%s\" has no effect",
vrel->relation->relname)));
include_parts = (classForm->relkind == RELKIND_PARTITIONED_TABLE);
ReleaseSysCache(tuple); ReleaseSysCache(tuple);
/* /*
* If it is, make relation list entries for its partitions. Note that * Unless the user has specified ONLY, make relation list entries for
* the list returned by find_all_inheritors() includes the passed-in * its partitions or inheritance child tables. Note that the list
* OID, so we have to skip that. There's no point in taking locks on * returned by find_all_inheritors() includes the passed-in OID, so we
* the individual partitions yet, and doing so would just add * have to skip that. There's no point in taking locks on the
* unnecessary deadlock risk. For this last reason we do not check * individual partitions or child tables yet, and doing so would just
* yet the ownership of the partitions, which get added to the list to * add unnecessary deadlock risk. For this last reason, we do not yet
* process. Ownership will be checked later on anyway. * check the ownership of the partitions/tables, which get added to
* the list to process. Ownership will be checked later on anyway.
*/ */
if (include_parts) if (include_children)
{ {
List *part_oids = find_all_inheritors(relid, NoLock, NULL); List *part_oids = find_all_inheritors(relid, NoLock, NULL);
ListCell *part_lc; ListCell *part_lc;

View File

@ -11921,7 +11921,7 @@ opt_name_list:
; ;
vacuum_relation: vacuum_relation:
qualified_name opt_name_list relation_expr opt_name_list
{ {
$$ = (Node *) makeVacuumRelation($1, InvalidOid, $2); $$ = (Node *) makeVacuumRelation($1, InvalidOid, $2);
} }

View File

@ -291,6 +291,98 @@ ANALYZE vactst, vactst;
BEGIN; -- ANALYZE behaves differently inside a transaction block BEGIN; -- ANALYZE behaves differently inside a transaction block
ANALYZE vactst, vactst; ANALYZE vactst, vactst;
COMMIT; COMMIT;
--
-- Tests for ANALYZE ONLY / VACUUM ONLY on partitioned tables
--
CREATE TABLE only_parted (a int, b text) PARTITION BY LIST (a);
CREATE TABLE only_parted1 PARTITION OF only_parted FOR VALUES IN (1);
INSERT INTO only_parted VALUES (1, 'a');
-- Ensure only the partitioned table is analyzed
ANALYZE ONLY only_parted;
SELECT relname, last_analyze IS NOT NULL AS analyzed, last_vacuum IS NOT NULL AS vacuumed
FROM pg_stat_user_tables
WHERE relid IN ('only_parted'::regclass, 'only_parted1'::regclass)
ORDER BY relname;
relname | analyzed | vacuumed
--------------+----------+----------
only_parted | t | f
only_parted1 | f | f
(2 rows)
-- Ensure partitioned table and the partitions are analyzed
ANALYZE only_parted;
SELECT relname, last_analyze IS NOT NULL AS analyzed, last_vacuum IS NOT NULL AS vacuumed
FROM pg_stat_user_tables
WHERE relid IN ('only_parted'::regclass, 'only_parted1'::regclass)
ORDER BY relname;
relname | analyzed | vacuumed
--------------+----------+----------
only_parted | t | f
only_parted1 | t | f
(2 rows)
DROP TABLE only_parted;
-- VACUUM ONLY on a partitioned table does nothing, ensure we get a warning.
VACUUM ONLY vacparted;
WARNING: VACUUM ONLY of partitioned table "vacparted" has no effect
-- Try ANALYZE ONLY with a column list
ANALYZE ONLY vacparted(a,b);
--
-- Tests for VACUUM ONLY / ANALYZE ONLY on inheritance tables
--
CREATE TABLE only_inh_parent (a int primary key, b TEXT);
CREATE TABLE only_inh_child () INHERITS (only_inh_parent);
INSERT INTO only_inh_child(a,b) VALUES (1, 'aaa'), (2, 'bbb'), (3, 'ccc');
-- Ensure only parent is analyzed
ANALYZE ONLY only_inh_parent;
SELECT relname, last_analyze IS NOT NULL AS analyzed, last_vacuum IS NOT NULL AS vacuumed
FROM pg_stat_user_tables
WHERE relid IN ('only_inh_parent'::regclass, 'only_inh_child'::regclass)
ORDER BY relname;
relname | analyzed | vacuumed
-----------------+----------+----------
only_inh_child | f | f
only_inh_parent | t | f
(2 rows)
-- Ensure the parent and child are analyzed
ANALYZE only_inh_parent;
SELECT relname, last_analyze IS NOT NULL AS analyzed, last_vacuum IS NOT NULL AS vacuumed
FROM pg_stat_user_tables
WHERE relid IN ('only_inh_parent'::regclass, 'only_inh_child'::regclass)
ORDER BY relname;
relname | analyzed | vacuumed
-----------------+----------+----------
only_inh_child | t | f
only_inh_parent | t | f
(2 rows)
-- Ensure only the parent is vacuumed
VACUUM ONLY only_inh_parent;
SELECT relname, last_analyze IS NOT NULL AS analyzed, last_vacuum IS NOT NULL AS vacuumed
FROM pg_stat_user_tables
WHERE relid IN ('only_inh_parent'::regclass, 'only_inh_child'::regclass)
ORDER BY relname;
relname | analyzed | vacuumed
-----------------+----------+----------
only_inh_child | t | f
only_inh_parent | t | t
(2 rows)
-- Ensure parent and child are vacuumed
VACUUM only_inh_parent;
SELECT relname, last_analyze IS NOT NULL AS analyzed, last_vacuum IS NOT NULL AS vacuumed
FROM pg_stat_user_tables
WHERE relid IN ('only_inh_parent'::regclass, 'only_inh_child'::regclass)
ORDER BY relname;
relname | analyzed | vacuumed
-----------------+----------+----------
only_inh_child | t | t
only_inh_parent | t | t
(2 rows)
DROP TABLE only_inh_parent CASCADE;
NOTICE: drop cascades to table only_inh_child
-- parenthesized syntax for ANALYZE -- parenthesized syntax for ANALYZE
ANALYZE (VERBOSE) does_not_exist; ANALYZE (VERBOSE) does_not_exist;
ERROR: relation "does_not_exist" does not exist ERROR: relation "does_not_exist" does not exist

View File

@ -233,6 +233,72 @@ BEGIN; -- ANALYZE behaves differently inside a transaction block
ANALYZE vactst, vactst; ANALYZE vactst, vactst;
COMMIT; COMMIT;
--
-- Tests for ANALYZE ONLY / VACUUM ONLY on partitioned tables
--
CREATE TABLE only_parted (a int, b text) PARTITION BY LIST (a);
CREATE TABLE only_parted1 PARTITION OF only_parted FOR VALUES IN (1);
INSERT INTO only_parted VALUES (1, 'a');
-- Ensure only the partitioned table is analyzed
ANALYZE ONLY only_parted;
SELECT relname, last_analyze IS NOT NULL AS analyzed, last_vacuum IS NOT NULL AS vacuumed
FROM pg_stat_user_tables
WHERE relid IN ('only_parted'::regclass, 'only_parted1'::regclass)
ORDER BY relname;
-- Ensure partitioned table and the partitions are analyzed
ANALYZE only_parted;
SELECT relname, last_analyze IS NOT NULL AS analyzed, last_vacuum IS NOT NULL AS vacuumed
FROM pg_stat_user_tables
WHERE relid IN ('only_parted'::regclass, 'only_parted1'::regclass)
ORDER BY relname;
DROP TABLE only_parted;
-- VACUUM ONLY on a partitioned table does nothing, ensure we get a warning.
VACUUM ONLY vacparted;
-- Try ANALYZE ONLY with a column list
ANALYZE ONLY vacparted(a,b);
--
-- Tests for VACUUM ONLY / ANALYZE ONLY on inheritance tables
--
CREATE TABLE only_inh_parent (a int primary key, b TEXT);
CREATE TABLE only_inh_child () INHERITS (only_inh_parent);
INSERT INTO only_inh_child(a,b) VALUES (1, 'aaa'), (2, 'bbb'), (3, 'ccc');
-- Ensure only parent is analyzed
ANALYZE ONLY only_inh_parent;
SELECT relname, last_analyze IS NOT NULL AS analyzed, last_vacuum IS NOT NULL AS vacuumed
FROM pg_stat_user_tables
WHERE relid IN ('only_inh_parent'::regclass, 'only_inh_child'::regclass)
ORDER BY relname;
-- Ensure the parent and child are analyzed
ANALYZE only_inh_parent;
SELECT relname, last_analyze IS NOT NULL AS analyzed, last_vacuum IS NOT NULL AS vacuumed
FROM pg_stat_user_tables
WHERE relid IN ('only_inh_parent'::regclass, 'only_inh_child'::regclass)
ORDER BY relname;
-- Ensure only the parent is vacuumed
VACUUM ONLY only_inh_parent;
SELECT relname, last_analyze IS NOT NULL AS analyzed, last_vacuum IS NOT NULL AS vacuumed
FROM pg_stat_user_tables
WHERE relid IN ('only_inh_parent'::regclass, 'only_inh_child'::regclass)
ORDER BY relname;
-- Ensure parent and child are vacuumed
VACUUM only_inh_parent;
SELECT relname, last_analyze IS NOT NULL AS analyzed, last_vacuum IS NOT NULL AS vacuumed
FROM pg_stat_user_tables
WHERE relid IN ('only_inh_parent'::regclass, 'only_inh_child'::regclass)
ORDER BY relname;
DROP TABLE only_inh_parent CASCADE;
-- parenthesized syntax for ANALYZE -- parenthesized syntax for ANALYZE
ANALYZE (VERBOSE) does_not_exist; ANALYZE (VERBOSE) does_not_exist;
ANALYZE (nonexistent-arg) does_not_exist; ANALYZE (nonexistent-arg) does_not_exist;