Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Server Side Collect

Input Data:

  • bucket name: s3://weborama-<publisher_name>.

  • path: /seed_urls/<yyyymm>/<dd>/<hh>: Par exemple: /seed_urls/202106/08/02/

  • Frequency: every hour.

  • File name: input_yyyymm-dd-hh.csv.gz

  • File format: compressed CSV

  • File content: 1 url per row.

Note

Protocol needs to be added in the source file, otherwise urls will be dropped

Client Side Collect

We deployed the Contextual Collect tag. In order to be used by a Publisher to collect urls, you have to first ask goldenfish-support@weborama.com to create a dedicated contextual client and provide you with the client ID. 

...