如何将 Find 的 output 排序到 psql 复制命令以按顺序加载数据？

Question

I wish to load data to a PostgreSQL DB from a bunch of files that are in multiple folders.我希望从多个文件夹中的一堆文件中将数据加载到 PostgreSQL 数据库中。 I have to load them in order (ie files in folders 2020 must be loaded before folders 2021 and so on).我必须按顺序加载它们（即文件夹 2020 中的文件必须在文件夹 2021 之前加载，依此类推）。 This is what I have currently:这是我目前拥有的：

find  ~/data/inserts/ -type f -exec psql -h db1.cluster-xxxxx.us-east-1.rds.amazonaws.com -p 5432 dbname -U admin1 -c "\COPY public.db1(col1,col2) FROM '{}' DELIMITER ',' CSV HEADER;" \;

This loads the data in the files, but the files aren't sorted.这会加载文件中的数据，但不会对文件进行排序。 By Googling, I know you can pipe into sort like so:通过谷歌搜索，我知道您可以将 pipe sort为：

find ~/data/inserts/ -type f -print | sort -z | xargs -r0 echo

but I am not sure how to apply it to my case.但我不确定如何将其应用于我的案例。 I am not sure how to use xargs -r0 even after reading the docs.即使在阅读文档后，我也不确定如何使用xargs -r0 。

Answer 1

You need -print0 instead of -print as find argument:您需要-print0而不是-print作为find参数：

#!/usr/bin/env bash

# Pipe the sorted null delimited output of find to while loop
find ./ -type f -print0 | sort -z |
while IFS= read -r -d '' input_file || [ -n "$input_file" ]; do
  # Now execute the pgsql command to copy from STDIN rather than named file
  psql \
    -h db1.cluster-xxxxx.us-east-1.rds.amazonaws.com -p 5432 -U admin1 dbname \
    -c "COPY public.db1(col1,col2) FROM STDIN DELIMITER ',' CSV HEADER;" \
    <"$input_file" # This provide the input file as STDIN
done

如何将 Find 的 output 排序到 psql 复制命令以按顺序加载数据？

问题描述

1 个解决方案

解决方案1
2 已采纳 2021-04-07 12:39:15

如何将 Find 的 output 排序到 psql 复制命令以按顺序加载数据？

问题描述

1 个解决方案

解决方案1 2 已采纳 2021-04-07 12:39:15

解决方案1
2 已采纳 2021-04-07 12:39:15