Spiculecharms Sparkler

  • By Spicule Charms
Channel Revision Published Runs on
latest/stable 10 19 Mar 2021
Ubuntu 16.04 Ubuntu 14.04
juju deploy spiculecharms-sparkler
Show information

Platform:

Ubuntu
16.04 14.04

Learn about actions >

  • addseedurls

    Add seed urls to sparkler

    Params
    • urls string

      Comma separated list of urls

  • crawl

    Crawl the urls

    Params
    • crawldb-uri string

      Crawldb uri

    • iterations string

      Number of iterations to run

    • jobid string

      Job id. When not sure, get the job id from injector command

    • output string

      Output path

    • sparkmaster string

      Spark Master URI. Not required if relation to spark is set.

    • topgroups string

      Max Groups to be selected for fetch

    • topn string

      Top urls per domain to be selected for a round.

  • inject

    Inject seed urls

    Params
    • crawldb-uri string

      Crawldb uri

    • jobid string

      Id of an existing Job to which the urls are to be injected. No argument will create a new job.

  • removeallseedurls

    Remove all seed urls from Sparkler

  • removeseedurls

    Remove seed urls

    Params
    • urls string

      Comma separated list of urls to remove