Easy way to python in aws - use awswrangler
- bdata3
- Aug 3, 2021
- 1 min read
If you write python code in AWS environment you should consider use awswrangler.
I wrote a simple example - for finding a dataframe containing Geva in the name columns:
It first gets the list of all candidate files (you can use wild cards ...). than iterate on the results and find the file based on the condition
Enjoy!
If you wonder what awswrangler so:
As aws states: awswrangler is An AWS Professional Service open source python initiative that extends the power of Pandas library to AWS connecting DataFrames and AWS data-related services.
Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer, and S3 (Parquet, CSV, JSON, and EXCEL).
Built on top of other open-source projects like Pandas, Apache Arrow and Boto3, it offers abstracted functions to execute usual ETL tasks like load/unload data from Data Lakes, Data Warehouses, and Databases.
you can find all documentation and quick start here:

Comments