Input Data:
bucket name: s3://weborama-adloox
path: /seed_urls/<yyyymm>/<dd>/<hh>: Par exemple: /seed_urls/202106/08/02/
Frequency: every hour.
File name: input_yyyymm-dd-hh.csv.gz
File format: compressed CSV
File content: 1 url per row.
Output Data:
bucket name: s3://weborama-adloox
path: /url_profiles/contextual/<yyyymm>/<dd>/<hh>: Par exemple: /contextual/202106/08/02/
File name: <yyyymmdd-url-profiles-<contextual_owner>-<target_id>.json.gz
Add Comment