postgres

Go to file

Peter Geoghegan 9dd963ae25 Recycle nbtree pages deleted during same VACUUM.

Maintain a simple array of metadata about pages that were deleted during
nbtree VACUUM's current btvacuumscan() call. Use this metadata at the
end of btvacuumscan() to attempt to place newly deleted pages in the FSM
without further delay. It might not yet be safe to place any of the
pages in the FSM by then (they may not be deemed recyclable), but we
have little to lose and plenty to gain by trying. In practice there is
a very good chance that this will work out when vacuuming larger
indexes, where scanning the index naturally takes quite a while.

This commit doesn't change the page recycling invariants; it merely
improves the efficiency of page recycling within the confines of the
existing design. Recycle safety is a part of nbtree's implementation of
what Lanin & Shasha call "the drain technique". The design happens to
use transaction IDs (they're stored in deleted pages), but that in
itself doesn't align the cutoff for recycle safety to any of the
XID-based cutoffs used by VACUUM (e.g., OldestXmin). All that matters
is whether or not _other_ backends might be able to observe various
inconsistencies in the tree structure (that they cannot just detect and
recover from by moving right). Recycle safety is purely a question of
maintaining the consistency (or the apparent consistency) of a physical
data structure.

Note that running a simple serial test case involving a large range
DELETE followed by a VACUUM VERBOSE will probably show that any newly
deleted nbtree pages are not yet reusable/recyclable. This is expected
in the absence of even one concurrent XID assignment. It is an old
implementation restriction. In practice it's unlikely to be the thing
that makes recycling remain unsafe, at least with larger indexes, where
recycling newly deleted pages during the same VACUUM actually matters.

An important high-level goal of this commit (as well as related recent
commits e5d8a999 and 9f3665fb) is to make expensive deferred cleanup
operations in index AMs rare in general. If index vacuuming frequently
depends on the next VACUUM operation finishing off work that the current
operation started, then the general behavior of index vacuuming is hard
to predict. This is relevant to ongoing work that adds a vacuumlazy.c
mechanism to skip index vacuuming in certain cases. Anything that makes
the real world behavior of index vacuuming simpler and more linear will
also make top-down modeling in vacuumlazy.c more robust.

Author: Peter Geoghegan <pg@bowt.ie>
Reviewed-By: Masahiko Sawada <sawada.mshk@gmail.com>
Discussion: https://postgr.es/m/CAH2-Wzk76_P=67iUscb1UN44-gyZL-KgpsXbSxq_bdcMa7Q+wQ@mail.gmail.com

2021-03-21 15:25:39 -07:00

config

Update copyright for 2021

2021-01-02 13:06:25 -05:00

contrib

Allow configurable LZ4 TOAST compression.

2021-03-19 15:10:38 -04:00

doc

Provide recovery_init_sync_method=syncfs.

2021-03-20 12:07:28 +13:00

src

Recycle nbtree pages deleted during same VACUUM.

2021-03-21 15:25:39 -07:00

.dir-locals.el

Make Emacs perl-mode indent more like perltidy.

2019-01-13 11:32:31 -08:00

.editorconfig

Add .editorconfig

2019-12-18 09:13:13 +01:00

.gitattributes

gitattributes: Add new file

2019-11-12 08:13:55 +01:00

.gitignore

Support for optimizing and emitting code in LLVM JIT provider.

2018-03-22 11:05:22 -07:00

aclocal.m4

Remove configure-time probe for DocBook DTD.

2020-11-30 15:24:13 -05:00

configure

Bring configure support for LZ4 up to snuff.

2021-03-21 17:20:17 -04:00

configure.ac

Bring configure support for LZ4 up to snuff.

2021-03-21 17:20:17 -04:00

Update copyright for 2021

2021-01-02 13:06:25 -05:00

GNUmakefile.in

Make install-tests target work with vpath builds

2020-05-31 18:33:00 -04:00

HISTORY

Canonicalize some URLs

2020-02-10 20:47:50 +01:00

Makefile

Don't unset MAKEFLAGS in non-GNU Makefile.

2019-06-25 09:36:21 +12:00

README

Canonicalize some URLs

2020-02-10 20:47:50 +01:00

README.git

Canonicalize some URLs

2020-02-10 20:47:50 +01:00

README

PostgreSQL Database Management System
=====================================

This directory contains the source code distribution of the PostgreSQL
database management system.

PostgreSQL is an advanced object-relational database management system
that supports an extended subset of the SQL standard, including
transactions, foreign keys, subqueries, triggers, user-defined types
and functions.  This distribution also contains C language bindings.

PostgreSQL has many language interfaces, many of which are listed here:

	https://www.postgresql.org/download/

See the file INSTALL for instructions on how to build and install
PostgreSQL.  That file also lists supported operating systems and
hardware platforms and contains information regarding any other
software packages that are required to build or run the PostgreSQL
system.  Copyright and license information can be found in the
file COPYRIGHT.  A comprehensive documentation set is included in this
distribution; it can be read as described in the installation
instructions.

The latest version of this software may be obtained at
https://www.postgresql.org/download/.  For more information look at our
web site located at https://www.postgresql.org/.

Languages

C 85.7%

PLpgSQL 5.8%

Perl 4.1%

Yacc 1.3%

Makefile 0.7%

Other 2.3%