(from github.com/goldenivan)
Hi everyone,
I am trying to crawl a Drupal website. For this one, I need an authentication. I tried the method in the documentation here : https://fess.codelibs.org/12.1/admin/webconfig-guide.html
I used this configuration, but it is not working :
Web Crawling Configuration
ID = g55y9mEB4EJ67hIPqNja
URLs = http://example.com/login
Included URLs For Crawling = http://example.com/.*
Config Parameters = client.robotsTxtEnabled=false
User Agent = Mozilla/5.0 (compatible; Fess/12.1; +http://fess.codelibs.org/bot.html)
The number of Thread = 1
Interval time = 10000 ms
Boost = 1.0
Permissions = {role}guest
Crawler in the Scheduler
Target = all
Schedule = 0 0 * * *
Executor = groovy
Script = return container.getComponent(“crawlJob”).logLevel(“info”).sessionId(“g55y9mEB4EJ67hIPqNja”).webConfigIds([“g55y9mEB4EJ67hIPqNja”] as String[]).fileConfigIds([] as String[]).dataConfigIds([] as String[]).jobExecutor(executor).execute();
Web Authentication
Hostname = http://example.com
Scheme = Form
Username = user
Password = pwd
Parameters =
encoding=UTF-8
token_method=GET
token_url=http://example.com/login
token_pattern=name=”authenticity_token” +value=”([^”]+)”
token_name=authenticity_token
login_method=POST
login_url=http://example.com/login
login_parameters=name=${username}&pass=${password}
form_build_id=“MyToken”
form_id=“connect”
op=“Connection”
Fess fail to connect on the website. Have you any idea about what is not working ?
Have you any idea what is happening ?