Fetch a Zip Archive and Scan Extract
ZIP archives can be remotely fetched via the api command for entry metadata scanning and extraction/conversion to datasets.
Specific file entries can be extracted and automatically serialized into datasets.
Let's take a look at an example.
Example​
Full Query​
api get http://featureblend.com/static/zip_example.zip
|| api get http://featureblend.com/static/zip_example.zip --unzip.path $path$
Stage 1: Open and scan a Zip file entries​
api get http://featureblend.com/static/zip_example.zip
The first stage will scan a Zip file and return a list of internal file entries metadata. Metadata includes specific information about each file found in a Zip archive including its name, path, size and other goodies. The Zip file metadata does not include the decompressed content.
Stage 2: Extract and convert files into a merged dataset​
...
|| api get http://featureblend.com/static/zip_example.zip --unzip.path $path$
With our list of zip file entries, we will use the api command with a $path$ token to pluck the item value from each row. The $path$ token will be used to construct the --unzip.path value that will automatically extract and serialize the specified file contents into a dataset.
NOTE: The automagic serialization of the contents of a file into a dataset is derived from it's file extension.