databrick add data that resides in AWS s3


To connect to your data source, you have placed in an s3 bucket, goto DataBricks -> Catalog ->  


Then



Then click on "+" to start the integration process




Then select AWS Quickstart. This will help you to create the necessary role and permission in your AWS stack.




Then click on "Next".  Provide your s3 bucket name and then you can click on Generate PAT token. Make sure you copy it and then click on "Launch Quickstart" - and you need to paste this information into the AWS cloud formation. 





The rest of the information would have been populated in AWS console and all you need to do is paste in your PAT token. Click on next. Once it is completed - about 5-10 minutes, refresh your databrick workspace and you should be able to see all your files there. 












Comments

Popular posts from this blog

The specified initialization vector (IV) does not match the block size for this algorithm