The current hive_follows table is bigger than it should be.
Replace it with the tables below which are smaller and better optimized for the queries we use. In particular, this should speed up muting, since there are fewer mutes than follows.
This will require both the sync and API queries to be changed.
Functions that return created_at can look up the time using hafd.blocks.
CREATE TABLE follows ( follower INTEGER, following INTEGER, block_num INTEGER, PRIMARY KEY (follower, following));CREATE INDEX follows_follower_idx ON follows(follower);CREATE INDEX follows_following_idx ON follows(following);CREATE TABLE muted ( follower INTEGER, following INTEGER, block_num INTEGER, PRIMARY KEY (follower, following));CREATE INDEX muted_follower_idx ON muted(follower);CREATE INDEX muted_following_idx ON muted(following);CREATE TABLE blacklisted ( follower INTEGER, following INTEGER, block_num INTEGER, PRIMARY KEY (follower, following));CREATE INDEX blacklisted_follower_idx ON blacklisted(follower);CREATE INDEX blacklisted_following_idx ON blacklisted(following);CREATE TABLE follow_muted ( follower INTEGER, following INTEGER, block_num INTEGER, PRIMARY KEY (follower, following));CREATE INDEX follow_muted_follower_idx ON follow_muted(follower);CREATE INDEX follow_muted_following_idx ON follow_muted(following);CREATE TABLE follow_blacklisted ( follower INTEGER, following INTEGER, block_num INTEGER, PRIMARY KEY (follower, following));CREATE INDEX follow_blacklisted_follower_idx ON follow_blacklisted(follower);CREATE INDEX follow_blacklisted_following_idx ON follow_blacklisted(following);
Edited
Designs
Child items ...
Show closed items
Linked items 0
Link issues together to show that they're related.
Learn more.
The new table functions will return follows/following sorted by name, unlike the old functions that used position in the table to determine the ordering.
This issue is currently being implemented in !863 which is also re-writing code for update_notification_cache to incrementally update the notification cache instead of totally re-filling the entire table each block.
The times for processing blocks with the new implementation 1e627fa0 are surprisingly varied. Some of the time, a block only takes .046s to process, which is fairly reasonable, but on other blocks it is 10x slower (e.g. 0.462s). Maybe it is just due to a lot of activity in those blocks, but it probably needs investigation just to be sure code is maximally optimized. Below is some of the log:
INFO - hive.indexer.sync:184 - [SINGLE] *** SINGLE block processing***INFO - hive.indexer.sync:185 - [SINGLE] Current system time: 2025-03-23 16:11:10.096INFO - hive.indexer.sync:246 - [SINGLE] Attempting to process first block in range: <94391500:94391500>INFO - hive.indexer.blocks:207 - #############################################################################INFO - hive.indexer.blocks:224 - [PROCESS MULTI] Flushing data in 1 threadINFO - hive.indexer.blocks:227 - [PROCESS MULTI] Tables updating in live synchronizationINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_posts_rshares(94391500, 94391500) executed in: 0.0055 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_children_count(94391500, 94391500) executed in: 0.0025 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_root_id(94391500,94391500) executed in: 0.0006 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_feed_cache(94391500, 94391500) executed in: 0.0004 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_mentions(94391500, 94391500) executed in: 0.0004 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_notification_cache(94391500, 94391500, False) executed in: 0.0484 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_follow_count(94391500, 94391500) executed in: 0.0006 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_last_completed_block(94391500) executed in: 0.0021 sINFO - hive.utils.timer:18 - on_live_blocks_processed executed in 0.0611 sINFO - hive.indexer.blocks:235 - [PROCESS MULTI] 1 blocks in 0.1065sINFO - hive.indexer.sync:123 - Last imported block is: 94391500INFO - hive.indexer.sync:123 - Last imported block is: 94391500INFO - hive.indexer.sync:123 - Last imported block is: 94391500INFO - hive.indexer.sync:241 - Next block range from hive.app_next_iteration is: <94391501:94391501>INFO - hive.indexer.sync:143 - target_head_block: 94391501INFO - hive.indexer.sync:144 - test_max_block: NoneINFO - hive.indexer.sync:184 - [SINGLE] *** SINGLE block processing***INFO - hive.indexer.sync:185 - [SINGLE] Current system time: 2025-03-23 16:11:13.220INFO - hive.indexer.sync:246 - [SINGLE] Attempting to process first block in range: <94391501:94391501>INFO - hive.indexer.blocks:207 - #############################################################################INFO - hive.indexer.blocks:224 - [PROCESS MULTI] Flushing data in 1 threadINFO - hive.indexer.blocks:227 - [PROCESS MULTI] Tables updating in live synchronizationINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_posts_rshares(94391501, 94391501) executed in: 0.0051 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_children_count(94391501, 94391501) executed in: 0.0024 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_root_id(94391501,94391501) executed in: 0.0003 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_feed_cache(94391501, 94391501) executed in: 0.0002 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_mentions(94391501, 94391501) executed in: 0.0003 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_notification_cache(94391501, 94391501, False) executed in: 0.0447 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_follow_count(94391501, 94391501) executed in: 0.0005 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_last_completed_block(94391501) executed in: 0.0021 sINFO - hive.utils.timer:18 - on_live_blocks_processed executed in 0.0559 sINFO - hive.indexer.blocks:235 - [PROCESS MULTI] 1 blocks in 0.0817sINFO - hive.indexer.sync:123 - Last imported block is: 94391501INFO - hive.indexer.sync:123 - Last imported block is: 94391501INFO - hive.indexer.sync:123 - Last imported block is: 94391501INFO - hive.indexer.sync:241 - Next block range from hive.app_next_iteration is: <94391502:94391502>INFO - hive.indexer.sync:143 - target_head_block: 94391502INFO - hive.indexer.sync:144 - test_max_block: NoneINFO - hive.indexer.sync:184 - [SINGLE] *** SINGLE block processing***INFO - hive.indexer.sync:185 - [SINGLE] Current system time: 2025-03-23 16:11:16.318INFO - hive.indexer.sync:246 - [SINGLE] Attempting to process first block in range: <94391502:94391502>INFO - hive.indexer.blocks:207 - #############################################################################INFO - hive.indexer.blocks:224 - [PROCESS MULTI] Flushing data in 1 threadINFO - hive.indexer.blocks:227 - [PROCESS MULTI] Tables updating in live synchronizationINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_posts_rshares(94391502, 94391502) executed in: 0.0034 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_children_count(94391502, 94391502) executed in: 0.0024 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_root_id(94391502,94391502) executed in: 0.0003 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_feed_cache(94391502, 94391502) executed in: 0.0003 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_mentions(94391502, 94391502) executed in: 0.0017 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_notification_cache(94391502, 94391502, False) executed in: 0.4623 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_follow_count(94391502, 94391502) executed in: 0.0006 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_last_completed_block(94391502) executed in: 0.0014 sINFO - hive.utils.timer:18 - on_live_blocks_processed executed in 0.4728 sINFO - hive.indexer.blocks:235 - [PROCESS MULTI] 1 blocks in 0.4977sINFO - hive.indexer.sync:123 - Last imported block is: 94391502INFO - hive.indexer.sync:123 - Last imported block is: 94391502INFO - hive.indexer.sync:241 - Next block range from hive.app_next_iteration is: <94391503:94391503>INFO - hive.indexer.sync:143 - target_head_block: 94391503INFO - hive.indexer.sync:144 - test_max_block: NoneINFO - hive.indexer.sync:184 - [SINGLE] *** SINGLE block processing***INFO - hive.indexer.sync:185 - [SINGLE] Current system time: 2025-03-23 16:11:18.326INFO - hive.indexer.sync:246 - [SINGLE] Attempting to process first block in range: <94391503:94391503>INFO - hive.indexer.blocks:207 - #############################################################################INFO - hive.indexer.blocks:224 - [PROCESS MULTI] Flushing data in 1 threadINFO - hive.indexer.blocks:227 - [PROCESS MULTI] Tables updating in live synchronizationINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_posts_rshares(94391503, 94391503) executed in: 0.0034 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_children_count(94391503, 94391503) executed in: 0.0024 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_root_id(94391503,94391503) executed in: 0.0003 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_feed_cache(94391503, 94391503) executed in: 0.0022 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_mentions(94391503, 94391503) executed in: 0.0004 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_notification_cache(94391503, 94391503, False) executed in: 0.0453 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_follow_count(94391503, 94391503) executed in: 0.0005 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_last_completed_block(94391503) executed in: 0.0015 sINFO - hive.utils.timer:18 - on_live_blocks_processed executed in 0.0563 sINFO - hive.indexer.blocks:235 - [PROCESS MULTI] 1 blocks in 0.0869sINFO - hive.indexer.sync:123 - Last imported block is: 94391503INFO - hive.indexer.sync:123 - Last imported block is: 94391503INFO - hive.indexer.sync:123 - Last imported block is: 94391503INFO - hive.indexer.sync:241 - Next block range from hive.app_next_iteration is: <94391504:94391504>INFO - hive.indexer.sync:143 - target_head_block: 94391504INFO - hive.indexer.sync:144 - test_max_block: NoneINFO - hive.indexer.sync:184 - [SINGLE] *** SINGLE block processing***INFO - hive.indexer.sync:185 - [SINGLE] Current system time: 2025-03-23 16:11:21.432INFO - hive.indexer.sync:246 - [SINGLE] Attempting to process first block in range: <94391504:94391504>INFO - hive.indexer.blocks:207 - #############################################################################INFO - hive.indexer.blocks:224 - [PROCESS MULTI] Flushing data in 1 threadINFO - hive.indexer.blocks:227 - [PROCESS MULTI] Tables updating in live synchronizationINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_posts_rshares(94391504, 94391504) executed in: 0.0054 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_children_count(94391504, 94391504) executed in: 0.0018 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_root_id(94391504,94391504) executed in: 0.0003 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_feed_cache(94391504, 94391504) executed in: 0.0002 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_mentions(94391504, 94391504) executed in: 0.0024 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_notification_cache(94391504, 94391504, False) executed in: 0.4632 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_follow_count(94391504, 94391504) executed in: 0.0005 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_last_completed_block(94391504) executed in: 0.0038 sINFO - hive.utils.timer:18 - on_live_blocks_processed executed in 0.4778 sINFO - hive.indexer.blocks:235 - [PROCESS MULTI] 1 blocks in 0.5069sINFO - hive.indexer.sync:123 - Last imported block is: 94391504INFO - hive.indexer.sync:123 - Last imported block is: 94391504INFO - hive.indexer.sync:123 - Last imported block is: 94391504INFO - hive.indexer.sync:241 - Next block range from hive.app_next_iteration is: <94391505:94391505>INFO - hive.indexer.sync:143 - target_head_block: 94391505INFO - hive.indexer.sync:144 - test_max_block: NoneINFO - hive.indexer.sync:184 - [SINGLE] *** SINGLE block processing***INFO - hive.indexer.sync:185 - [SINGLE] Current system time: 2025-03-23 16:11:24.959INFO - hive.indexer.sync:246 - [SINGLE] Attempting to process first block in range: <94391505:94391505>WARNING - hive.indexer.custom_op:80 - setLastRead::date: `2025-03-23T16:13:21' exceeds head block time. Correcting to head block time: `2025-03-23T16:11:21'INFO - hive.indexer.blocks:207 - #############################################################################INFO - hive.indexer.blocks:224 - [PROCESS MULTI] Flushing data in 1 threadINFO - hive.indexer.blocks:227 - [PROCESS MULTI] Tables updating in live synchronizationINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_posts_rshares(94391505, 94391505) executed in: 0.0037 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_children_count(94391505, 94391505) executed in: 0.0025 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_root_id(94391505,94391505) executed in: 0.0003 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_feed_cache(94391505, 94391505) executed in: 0.0004 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_mentions(94391505, 94391505) executed in: 0.0019 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_notification_cache(94391505, 94391505, False) executed in: 0.4699 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_follow_count(94391505, 94391505) executed in: 0.0006 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_last_completed_block(94391505) executed in: 0.0031 sINFO - hive.utils.timer:18 - on_live_blocks_processed executed in 0.4827 sINFO - hive.indexer.blocks:235 - [PROCESS MULTI] 1 blocks in 0.5112sINFO - hive.indexer.sync:123 - Last imported block is: 94391505INFO - hive.indexer.sync:123 - Last imported block is: 94391505INFO - hive.indexer.sync:123 - Last imported block is: 94391505INFO - hive.indexer.sync:241 - Next block range from hive.app_next_iteration is: <94391506:94391506>INFO - hive.indexer.sync:143 - target_head_block: 94391506INFO - hive.indexer.sync:144 - test_max_block: NoneINFO - hive.indexer.sync:184 - [SINGLE] *** SINGLE block processing***INFO - hive.indexer.sync:185 - [SINGLE] Current system time: 2025-03-23 16:11:28.486INFO - hive.indexer.sync:246 - [SINGLE] Attempting to process first block in range: <94391506:94391506>INFO - hive.indexer.blocks:207 - #############################################################################INFO - hive.indexer.blocks:224 - [PROCESS MULTI] Flushing data in 1 threadINFO - hive.indexer.blocks:227 - [PROCESS MULTI] Tables updating in live synchronizationINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_posts_rshares(94391506, 94391506) executed in: 0.0041 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_children_count(94391506, 94391506) executed in: 0.0017 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_root_id(94391506,94391506) executed in: 0.0003 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_feed_cache(94391506, 94391506) executed in: 0.0002 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_hive_posts_mentions(94391506, 94391506) executed in: 0.0003 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_notification_cache(94391506, 94391506, False) executed in: 0.0456 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_follow_count(94391506, 94391506) executed in: 0.0005 sINFO - hive.indexer.blocks:435 - SELECT hivemind_app.update_last_completed_block(94391506) executed in: 0.0015 sINFO - hive.utils.timer:18 - on_live_blocks_processed executed in 0.0545 sINFO - hive.indexer.blocks:235 - [PROCESS MULTI] 1 blocks in 0.0740sINFO - hive.indexer.sync:123 - Last imported block is: 94391506INFO - hive.indexer.sync:123 - Last imported block is: 94391506INFO - hive.indexer.sync:241 - Next block range from hive.app_next_iteration is: <94391507:94391507>INFO - hive.indexer.sync:143 - target_head_block: 94391507INFO - hive.indexer.sync:144 - test_max_block: None
Fortunately, log above probably wasn't the most representative example, and in most cases, it seems like only 1 in 8 takes a long time. Overall, we are definitely seeing much less load on the node during live sync with this change of update_notification_cache to an incremental implementation, as expected. Still probably worth determining why some blocks are taking longer just in case it can be improved.
Note that hivemind_app.update_notification_cache function is scheduled for deletion in this MR.
The new code is in the indexer, which flush times are apparently not logged in live sync.