Importing Data From a URL

Behnam Samadi: Hey Team, I’m almost new to label-studio and I have a question that I couldn’t find its answer in docs. Can anybody please explain what exactly happens when we import data from URL?
To be more specific: after importing new data with URL, is all the data getting downloaded and stored in local storage or in this case label-studio is using local storage just for data cache?
All the data that we want to annotate is accessible by URL that specified by another service, and now our main challenge is to specify the amount of storage that we need to run label-studio service.

Chaskin Saroff: When you say “from a url”, do you mean cloud storage(e.g. gcs or s3, etc)?

Behnam Samadi: consider we are getting this urls from a third-party source, I mean no, they’re not from a cloud storage

Chris Hoge (HumanSignal): The section of the docs includes information about importing data from a URL. Unless it’s something like a CSV file, If you load data from a URL, the data is not saved in Label Studio. If you want an annotated task export to include the data that you annotated, you must import the data into the Label Studio database without using URL references, or combine the data with the annotations after exporting. https://labelstud.io/guide/tasks.html#How-to-import-your-data

Note: This post was generated by the Label Studio Archive Bot from a conversation in the Label Studio Slack, a gathering place for the Label Studio community. Someone in the community thought this was worth sharing!

If this post answered a question for you, hit the Like button - we use that to assess which posts to put into docs.

archivebot: Gosh, this is an interesting conversation - I’ve filed a copy at http://community.labelstud.io/t/importing-data-from-a-url/71 for future reference!