s3
It is common to store data in s3. RIME supports loading data directly from s3 so you do not have to copy the file locally.
Follow all of the instructions in the generic Data Configuration guide, except the path to the data should no longer be a local path but instead an s3 storage path in the format of s3://{bucket_name}/{object_key}
. Note that bucket_name
and object_key
should be replaced so they reference the desired csv or parquet file.
Example:
{
"data_info": {"ref_path": "s3://my-bucket/my-file.csv"}
}
In order to use this functionality, RIME assumes valid AWS credentials and region specification will be found in the AWS configuration file on your local machine (~/.aws/credentials
). Please make sure the credentials there are correct.