mirror of https://github.com/bitcoin/bitcoin.git synced 2026-02-16 18:39:18 +00:00

History

Merge bitcoin/bitcoin#24858 : incorrect blk file size calculation during reindex results in recoverable blk file corruption

bcb0cacac28e98a39dc856c574a0872fe17059e9 reindex, log, test: fixes #21379 (mruddy)

Pull request description:

  Fixes #21379.

  The blocks/blk?????.dat files are mutated and become increasingly malformed, or corrupt, as a result of running the re-indexing process.
  The mutations occur after the re-indexing process has finished, as new blocks are appended, but are a result of a re-indexing process miscalculation that lingers in the block manager's `m_blockfile_info` `nSize` data until node restart.
  These additions to the blk files are non-fatal, but also not desirable.
  That is, this is a form of data corruption that the reading code is lenient enough to process (it skips the extra bytes), but it adds some scary looking log messages as it encounters them.

  The summary of the problem is that the re-index process double counts the size of the serialization header (magic message start bytes [4 bytes] + length [4 bytes] = 8 bytes) while calculating the blk data file size (both values already account for the serialization header's size, hence why it is over accounted).

  This bug manifests itself in a few different ways, after re-indexing, when a new block from a peer is processed:
  1. If the new block will not fit into the last blk file processed while re-indexing, while remaining under the 128MiB limit, then the blk file is flushed to disk and truncated to a size that is 8 greater than it should be. The truncation adds zero bytes (see `FlatFileSeq::Flush` and `TruncateFile`).
  1. If the last blk file processed while re-indexing has logical space for the new block under the 128 MiB limit:
      1. If the blk file was not already large enough to hold the new block, then the zeros are, in effect, added by `fseek` when the file is opened for writing. Eight zero bytes are added to the end of the last blk file just before the new block is written. This happens because the write offset is 8 too great due to the miscalculation. The result is 8 zero bytes between the end of the last block and the beginning of the next block's magic + length + block.
      1. If the blk file was already large enough to hold the new block, then the current existing file contents remain in the 8 byte gap between the end of the last block and the beginning of the next block's magic + length + block. Commonly, when this occcurs, it is due to the blk file containing blocks that are not connected to the block tree during reindex and are thus left behind by the reindex process and later overwritten when new blocks are added. The orphaned blocks can be valid blocks, but due to the nature of concurrent block download, the parent may not have been retrieved and written by the time the node was previously shutdown.

ACKs for top commit:
  LarryRuane:
    tested code-review ACK bcb0cacac28e98a39dc856c574a0872fe17059e9
  ryanofsky:
    Code review ACK bcb0cacac28e98a39dc856c574a0872fe17059e9. This is a disturbing bug with an easy fix which seems well-worth merging.
  mzumsande:
    ACK bcb0cacac28e98a39dc856c574a0872fe17059e9 (reviewed code and did some testing, I agree that it fixes the bug).
  w0xlt:
    tACK bcb0cacac2

Tree-SHA512: acc97927ea712916506772550451136b0f1e5404e92df24cc05e405bb09eb6fe7c3011af3dd34a7723c3db17fda657ae85fa314387e43833791e9169c0febe51

2022-10-12 14:13:54 -04:00

data

scripted-diff: Regenerate key_io data deterministically

2022-04-06 17:08:07 +02:00

fuzz

fuzz: pass max fee into ConsumeTxMemPoolEntry

2022-10-04 21:12:50 +01:00

util

Merge bitcoin/bitcoin#26036 : net: add NetEventsInterface::g_msgproc_mutex

2022-09-20 14:18:23 +01:00

addrman_tests.cpp

addrman: Use system time instead of adjusted network time

2022-07-30 11:04:09 +02:00

allocator_tests.cpp

…

amount_tests.cpp

…

arith_uint256_tests.cpp

…

banman_tests.cpp

test: Use proper Boost macros instead of assertions

2022-10-03 00:00:31 +01:00

base32_tests.cpp

Make DecodeBase{32,64} return optional instead of taking bool*

2022-04-27 14:12:55 +02:00

base58_tests.cpp

refactor: Make const refs vars where applicable

2022-07-27 13:27:57 +02:00

base64_tests.cpp

Make DecodeBase{32,64} return optional instead of taking bool*

2022-04-27 14:12:55 +02:00

bech32_tests.cpp

…

bip32_tests.cpp

extended keys: fail to derive too large depth instead of wrapping around

2022-08-04 11:32:26 +02:00

blockchain_tests.cpp

refactor: use <cstdlib> over stdlib.h

2022-09-23 10:48:47 +01:00

blockencodings_tests.cpp

scripted-diff: test: Use CTxMemPool in TestingSetup

2022-06-15 17:28:55 -04:00

blockfilter_index_tests.cpp

Require callers of AcceptBlockHeader() to perform anti-dos checks

2022-08-29 08:10:35 -04:00

blockfilter_tests.cpp

refactor: Make const refs vars where applicable

2022-07-27 13:27:57 +02:00

blockmanager_tests.cpp

reindex, log, test: fixes #21379

2022-05-07 07:11:29 -04:00

bloom_tests.cpp

…

bswap_tests.cpp

…

checkqueue_tests.cpp

refactor: use C++11 default initializers

2022-05-17 17:18:58 +01:00

coins_tests.cpp

refactor: Remove defunct attributes.h includes

2022-05-21 13:54:33 -05:00

coinstatsindex_tests.cpp

Merge bitcoin/bitcoin#24513 : CChainState -> Chainstate

2022-09-13 15:42:18 +01:00

compilerbug_tests.cpp

…

compress_tests.cpp

…

crypto_tests.cpp

…

cuckoocache_tests.cpp

…

dbwrapper_tests.cpp

refactor: use C++11 default initializers

2022-05-17 17:18:58 +01:00

denialofservice_tests.cpp

net: drop cs_sendProcessing

2022-09-15 14:44:42 +10:00

descriptor_tests.cpp

test: remove unused norm_prv parameter

2022-08-21 18:26:11 -03:00

flatfile_tests.cpp

Use AutoFile where possible

2022-06-29 10:33:13 +02:00

fs_tests.cpp

…

getarg_tests.cpp

test: Remove boost::split from getarg_tests.cpp

2022-04-29 14:35:50 +02:00

hash_tests.cpp

…

headers_sync_chainwork_tests.cpp

Add unit test for HeadersSyncState

2022-08-29 08:10:35 -04:00

httpserver_tests.cpp

Add GetQueryParameter helper function

2022-03-10 12:01:54 +01:00

i2p_tests.cpp

Merge bitcoin/bitcoin#25614 : Severity-based logging, step 2

2022-09-01 15:57:56 -04:00

interfaces_tests.cpp

refactor: Add lock annotations to Active* methods

2022-08-16 17:26:40 +02:00

key_io_tests.cpp

refactor: Make const refs vars where applicable

2022-07-27 13:27:57 +02:00

key_tests.cpp

refactor: use Span in random.*

2022-03-23 17:36:33 -05:00

logging_tests.cpp

Create BCLog::Level::Trace log severity level

2022-08-20 11:55:17 +02:00

main.cpp

…

Makefile

…

mempool_tests.cpp

test: use NoLimits() in MempoolIndexingTest

2022-10-05 13:07:11 +01:00

merkle_tests.cpp

…

merkleblock_tests.cpp

…

miner_tests.cpp

test: Use dedicated mempool in TestBasicMining

2022-10-05 13:36:57 +02:00

miniscript_tests.cpp

Permit delaying duplicate key check in miniscript::Node construction

2022-09-17 10:47:05 +02:00

minisketch_tests.cpp

test: Prevent UB in minisketch_tests.cpp

2022-10-06 12:50:54 +01:00

multisig_tests.cpp

Pass datacarrier setting into IsStandard

2022-08-02 15:28:30 +02:00

net_peer_eviction_tests.cpp

doc: Convert remaining comments to clang-tidy format

2022-04-06 15:37:07 +02:00

net_tests.cpp

Merge bitcoin/bitcoin#26036 : net: add NetEventsInterface::g_msgproc_mutex

2022-09-20 14:18:23 +01:00

netbase_tests.cpp

Validate port value in SplitHostPort

2022-10-05 19:24:04 +02:00

orphanage_tests.cpp

[tests] Move TxOrphange tests to orphange_tests.cpp

2022-04-25 08:37:01 +01:00

pmt_tests.cpp

Remove not needed ArithToUint256 roundtrips in tests

2022-04-14 19:29:52 +02:00

policy_fee_tests.cpp

…

policyestimator_tests.cpp

test/policyestimator: Use ChainTestingSetup's CTxMemPool

2022-06-15 17:28:55 -04:00

pow_tests.cpp

Add function to validate difficulty changes

2022-08-23 11:34:10 -04:00

prevector_tests.cpp

test, bench: make prevector and checkqueue swap member functions noexcept

2022-04-28 20:34:43 +02:00

raii_event_tests.cpp

refactor: use <cstdlib> over stdlib.h

2022-09-23 10:48:47 +01:00

random_tests.cpp

refactor: Make FEELER_SLEEP_WINDOW type safe (std::chrono)

2022-07-13 15:21:12 +02:00

rbf_tests.cpp

[test] make tx6 child of tx5, not tx3, in rbf_tests

2022-08-11 12:48:09 +01:00

README.md

…

rest_tests.cpp

Handle query string when parsing data format

2022-03-10 12:01:53 +01:00

result_tests.cpp

refactor: Replace BResult with util::Result

2022-08-03 07:33:01 -04:00

reverselock_tests.cpp

…

rpc_tests.cpp

univalue: Remove unused and confusing set*() return value

2022-07-29 15:24:42 +02:00

sanity_tests.cpp

compat: remove glibcxx sanity checks

2022-05-28 09:43:02 +01:00

scheduler_tests.cpp

Switch scheduler to steady_clock

2022-05-10 10:54:54 +02:00

script_p2sh_tests.cpp

test: Add missing static to IsStandardTx helper

2022-08-03 11:19:53 +02:00

script_parse_tests.cpp

…

script_segwit_tests.cpp

…

script_standard_tests.cpp

scripted-diff: Use getInt<T> over get_int/get_int64

2022-05-18 19:15:03 +02:00

script_tests.cpp

refactor: Make const refs vars where applicable

2022-07-27 13:27:57 +02:00

scriptnum10.h

…

scriptnum_tests.cpp

…

serfloat_tests.cpp

…

serialize_tests.cpp

…

settings_tests.cpp

settings: Add update/getPersistent/isIgnored methods

2022-05-19 11:32:56 -04:00

sighash_tests.cpp

refactor: Make const refs vars where applicable

2022-07-27 13:27:57 +02:00

sigopcount_tests.cpp

…

skiplist_tests.cpp

Add functions to construct locators without CChain

2022-08-23 16:05:00 -04:00

sock_tests.cpp

refactor: move compat.h into compat/

2022-07-20 10:34:46 +01:00

streams_tests.cpp

…

sync_tests.cpp

…

system_tests.cpp

refactor: move run_command from util to common

2022-10-04 21:21:05 +00:00

timedata_tests.cpp

…

torcontrol_tests.cpp

…

transaction_tests.cpp

Merge bitcoin/bitcoin#25707 : refactor: Make const references to avoid unnecessarily copying objects and enable two clang-tidy checks

2022-08-19 17:11:06 +02:00

txindex_tests.cpp

indexes, refactor: Pass Chain interface instead of CChainState class to indexes

2022-07-18 13:39:55 -05:00

txpackage_tests.cpp

refactor: fixup named args in txpackage tests

2022-04-07 12:50:54 +01:00

txrequest_tests.cpp

…

txvalidation_tests.cpp

…

txvalidationcache_tests.cpp

tests: Reduce calls to InitS*Cache()

2022-08-03 12:02:31 -04:00

uint256_tests.cpp

…

util_tests.cpp

refactor: move Boost datetime usage to wallet

2022-10-01 11:41:53 +01:00

util_threadnames_tests.cpp

…

validation_block_tests.cpp

Require callers of AcceptBlockHeader() to perform anti-dos checks

2022-08-29 08:10:35 -04:00

validation_chainstate_tests.cpp

scripted-diff: rename CChainState -> Chainstate

2022-09-09 11:47:27 -04:00

validation_chainstatemanager_tests.cpp

Fix issues identified by codespell 2.2.1 and update ignored words

2022-09-15 13:03:40 +02:00

validation_flush_tests.cpp

scripted-diff: rename CChainState -> Chainstate

2022-09-09 11:47:27 -04:00

validation_tests.cpp

…

validationinterface_tests.cpp

…

versionbits_tests.cpp

Merge bitcoin/bitcoin#25200 : doc: Fix spelling errors identified by codespell in comments

2022-05-31 15:19:59 +02:00

README.md

Unit tests

The sources in this directory are unit test cases. Boost includes a unit testing framework, and since Bitcoin Core already uses Boost, it makes sense to simply use this framework rather than require developers to configure some other framework (we want as few impediments to creating unit tests as possible).

The build system is set up to compile an executable called test_bitcoin that runs all of the unit tests. The main source file for the test library is found in util/setup_common.cpp.

Compiling/running unit tests

Unit tests will be automatically compiled if dependencies were met in ./configure and tests weren't explicitly disabled.

After configuring, they can be run with make check.

To run the unit tests manually, launch src/test/test_bitcoin. To recompile after a test file was modified, run make and then run the test again. If you modify a non-test file, use make -C src/test to recompile only what's needed to run the unit tests.

To add more unit tests, add BOOST_AUTO_TEST_CASE functions to the existing .cpp files in the test/ directory or add new .cpp files that implement new BOOST_AUTO_TEST_SUITE sections.

To run the GUI unit tests manually, launch src/qt/test/test_bitcoin-qt

To add more GUI unit tests, add them to the src/qt/test/ directory and the src/qt/test/test_main.cpp file.

Running individual tests

test_bitcoin accepts the command line arguments from the boost framework. For example, to run just the getarg_tests suite of tests:

test_bitcoin --log_level=all --run_test=getarg_tests

log_level controls the verbosity of the test framework, which logs when a test case is entered, for example. test_bitcoin also accepts the command line arguments accepted by bitcoind. Use -- to separate both types of arguments:

test_bitcoin --log_level=all --run_test=getarg_tests -- -printtoconsole=1

The -printtoconsole=1 after the two dashes redirects the debug log, which would normally go to a file in the test datadir (BasicTestingSetup::m_path_root), to the standard terminal output.

... or to run just the doubledash test:

test_bitcoin --run_test=getarg_tests/doubledash

Run test_bitcoin --help for the full list.

Adding test cases

To add a new unit test file to our test suite you need to add the file to src/Makefile.test.include. The pattern is to create one test file for each class or source file for which you want to create unit tests. The file naming convention is <source_filename>_tests.cpp and such files should wrap their tests in a test suite called <source_filename>_tests. For an example of this pattern, see uint256_tests.cpp.

Logging and debugging in unit tests

make check will write to a log file foo_tests.cpp.log and display this file on failure. For running individual tests verbosely, refer to the section above.

To write to logs from unit tests you need to use specific message methods provided by Boost. The simplest is BOOST_TEST_MESSAGE.

For debugging you can launch the test_bitcoin executable with gdb or lldb and start debugging, just like you would with any other program:

gdb src/test/test_bitcoin

Segmentation faults

If you hit a segmentation fault during a test run, you can diagnose where the fault is happening by running gdb ./src/test/test_bitcoin and then using the bt command within gdb.

Another tool that can be used to resolve segmentation faults is valgrind.

If for whatever reason you want to produce a core dump file for this fault, you can do that as well. By default, the boost test runner will intercept system errors and not produce a core file. To bypass this, add --catch_system_errors=no to the test_bitcoin arguments and ensure that your ulimits are set properly (e.g. ulimit -c unlimited).

Running the tests and hitting a segmentation fault should now produce a file called core (on Linux platforms, the file name will likely depend on the contents of /proc/sys/kernel/core_pattern).

You can then explore the core dump using

gdb src/test/test_bitcoin core

(gbd) bt  # produce a backtrace for where a segfault occurred