Extract only file names from an Amazon S3 bucket

Question

I have a requirement to extract only file names from an Amazon S3 bucket without those extra 3 zeros after .csv , I'm doing that like this

# remove files so every time you have new names
rm ListOfFiles.txt

# get file names
aws s3 ls <bucket-address-directory-path> | awk '{print $4}' | sed 's/.csv000*/.csv/g' >> ListOfFiles.txt

I'm getting all those file names but there is a blank line at the top as directory there is a Folder. I don't need that folder, neither the blank line.

What in S3

Archive
ABC.csv000
BCD.csv000
DEF.csv000

What I'm getting

<a blank line here>
ABC.csv
BCD.csv
DEF.csv

What I need

ABC.csv
BCD.csv
DEF.csv

Answer 1

Combine awk and sed into one command, something like

aws s3 ls <bucket-address-directory-path> | sed -nr 's/.* ([^ ]*.csv)000.*/\1/p'

or

aws s3 ls <bucket-address-directory-path> | awk 'NF>3 { sub(/000$/,"", $4); print $4}'

Answer 2

change "{print $4}" to "{if(NR>1){print $4}}"

Extract only file names from an Amazon S3 bucket

Question

2 answers

solution1
3 ACCPTED 2021-08-22 10:51:03

solution2
0 2021-08-21 13:38:50

Extract only file names from an Amazon S3 bucket

Question

2 answers

solution1 3 ACCPTED 2021-08-22 10:51:03

solution2 0 2021-08-21 13:38:50

solution1
3 ACCPTED 2021-08-22 10:51:03

solution2
0 2021-08-21 13:38:50