How to Upload Large Files to Amazon S3 with AWS CLI

ByEric Ma Nov 29, 2015Aug 30, 2020

Amazon S3 is a widely used public cloud storage system. S3 allows an object/file to be up to 5TB which is enough for most applications. The AWS Management Console provides a Web-based interface for users to upload and manage files in S3 buckets. However, uploading a large files that is 100s of GB is not easy using the Web interface. From my experience, it fails frequently. There are various third party commercial tools that claims to help people upload large files to Amazon S3 and Amazon also provides a Multipart Upload API which is most of these tools based on.

While these tools are helpful, they are not free and AWS already provides users a pretty good tool for uploading large files to S3—the open source aws s3 CLI tool from Amazon. From my test, the aws s3 command line tool can achieve more than 7MB/s uploading speed in a shared 100Mbps network, which should be good enough for many situations and network environments. In this post, I will give a tutorial on uploading large files to Amazon S3 with the aws command line tool.

Install aws CLI tool

Assume that you already have Python environment set up on your computer. You can install aws tools ~~using pip or~~ using the bundled installer

$ curl "https://s3.amazonaws.com/aws-cli/awscli-bundle.zip" -o "awscli-bundle.zip"
$ unzip awscli-bundle.zip
$ sudo ./awscli-bundle/install -i /usr/local/aws -b /usr/local/bin/aws

Try to run aws after installation. If you see output as follows, you should have installed it successfully.

$ aws
usage: aws [options] <command> <subcommand> [<subcommand> ...] [parameters]
To see help text, you can run:

  aws help
  aws <command> help
  aws <command> <subcommand> help
aws: error: too few arguments

Configure `aws` tool access

The quickest way to configure the AWS CLI is to run the aws configure command:

$ aws configure
AWS Access Key ID: foo
AWS Secret Access Key: bar
Default region name [us-west-2]: us-west-2
Default output format [None]: json

Here, your AWS Access Key ID and AWS Secret Access Key can be found in Your Security Credentials on the AWS Console.

Uploading large files

Lastly, the fun comes. Here, assume we are uploading the large ./150GB.data to s3://systut-data-test/store_dir/ (that is, directory store-dir under bucket systut-data-test) and the bucket and directory are already created on S3. The command is:

$ aws s3 cp ./150GB.data s3://systut-data-test/store_dir/

After it starts to upload the file, it will print the progress message like

Completed 1 part(s) with ... file(s) remaining

at the beginning, and the progress message as follows when it is reaching the end.

Completed 9896 of 9896 part(s) with 1 file(s) remaining

After it successfully uploads the file, it will print a message like

upload: ./150GB.data to s3://systut-data-test/store_dir/150GB.data

aws has more commands to operate files on S3. I hope this tutorial helps you start with it. Check the manual for more details.

QA | Tutorial

How to generate a pair of SSH private key and public key pairs?

ByQ A Jul 16, 2019Nov 21, 2019

How to generate a pair of SSH private key and public key pairs? On Linux, you can generate one first by $ ssh-keygen -t rsa By default on Linux, the key pair is stored in `~/.ssh` named `id_rsa` and `id_rsa.pub` for the private and public key. Read more: Generating RSA Private and Public Key Pair…

How to save the output of screen windows to a file on Linux?

ByEric Ma Mar 24, 2018Mar 24, 2018

In the screen, copying the history of the window output is quite hard. How to save the screen easily to a file? First type Ctrl + A then : to get to command mode. In the command mode, execute hardcopy -h /path/to/file screen will save the window output to /path/to/file. Read more: How to run…

How to use OpenVPN together with Ovpn Spider on iPhone?

ByEric Ma Mar 24, 2018Mar 24, 2018

But.. how to use the OpenVPN together with Ovpn Spider on iPhone? You can check OpenVPN + Ovpn Spider: Free VPNs for iPhone Users. Read more: Free VPNs for iPhone Users: OpenVPN + Ovpn Spider How to use iPhone to browse blocked websites in mainland China? how to use pc internet of window8 on iphone…

Software | Tutorial

How to unignore some files or dirs in git?

ByEric Ma Mar 24, 2018Nov 24, 2019

I set up rules to ignore all files under git by adding a .gitignore like * But how to unignore some files or dirs in git? For example unignore all files under ./bin/tool1/ under the git repository. You can use patterns with ! as the prefix to “unignore” files. From gitignore man page: An optional prefix “!”…

How to change the mode for simplified Chinese or Traditional Chinese mode in ibus-libpinyin?

ByEric Ma Mar 24, 2018Nov 21, 2019

How to change the mode for simplified Chinese or Traditional Chinese mode in ibus-libpinyin? What is the shortcut like “Ctrl+.” for switching full width or half width punctuation. Use shortcut “Ctrl + Shift + F”. In ibus-pinyin version 1.3.7, this shortcut is added: 2010-05-28 ibus-pinyin 1.3.7 stable release Add Ctrl + Shift + F to…

How to tune systems to achieve high performance in virtualization circumstances?

ByWeiwei Jia Mar 24, 2018Jan 7, 2020

Most time, we need to tune system parameters to achieve better performance but what the general parameters to be tuned in Linux systems. I think you may want to add following parameters to Kernel boot (/etc/default/grub) parameters intel_idle.max_cstate=0 processor.max_cstate=0 idle=poll intel_pstate=disable At the same time, you may also want to shutdown/open Pause Loop Exiting (PLE)….

8 Comments

Eric Ma says:

Dec 16, 2015 at 4:57 pm

To upload a directory recursively, you may use `aws s3 sync`. For example, to upload current directory to my-bucket bucket under dir my-dir:

$ aws s3 sync . s3://my-bucket/my-dir/

Reply
Pedro says:

Jun 25, 2016 at 12:58 am

Hey Eric, is there a parameter available for the above command that would allow me to enforce TLS 1.2 encryption in-transit?

Reply
1. Eric Z Ma says:
  
  Jun 30, 2016 at 11:16 am
  
  I am not aware of such one. You may need to dig into the source code of aws-cli which is available at https://github.com/aws/aws-cli to investigate or make patch to enforce TLS 1.2.
  
  Reply
Nhu says:

Aug 12, 2016 at 1:44 pm

how do I sync between an sftp location and s3 bucket directly?

Reply
1. Eric Z Ma says:
  
  Aug 19, 2016 at 4:25 pm
  
  You may consider a solution like this:
  
  1. Mount the sftp location by sshfs http://www.systutorials.com/1505/mounting-remote-folder-through-ssh/ to a local directory.
  
  2. Use the tool in this post to upload the file to sync the local directory (mounted the sftp location) with your S3 bucket.
  
  Reply
sal says:

Nov 28, 2016 at 7:33 pm

What happens when a large file upload fails?? This is not covered.
I’ve been getting segfaults using the straight cp command, and re-running it will start again from the beginning. On large files this can mean days wasted.

Reply
1. Andy says:
  
  Mar 16, 2019 at 6:02 am
  
  Stumbled upon this while looking for solutions to upload large files.
  Check this link: https://aws.amazon.com/premiumsupport/knowledge-center/s3-multipart-upload-cli/
  If your cp process keeps dying, you may want to implicitly break it apart with the lower level s3api command set.
  
  Reply
Narendra says:

Apr 4, 2020 at 7:44 am

How do i upload a image file from my local folder to s3 bucket via command prompt.

Please help to provide CLI commands.

Reply

Install aws CLI tool

Configure aws tool access

Uploading large files

Similar Posts

8 Comments

Leave a Reply Cancel reply

Configure `aws` tool access