(from github.com/hottwerk)
Is it possible to remove already crawled urls from a index based on a regex?
lets say: delete all .*id_categorie=2.*
Or maybe filter them when search display?
(from github.com/hottwerk)
Is it possible to remove already crawled urls from a index based on a regex?
lets say: delete all .*id_categorie=2.*
Or maybe filter them when search display?
(from github.com/marevol)
You can remove it on Admin Search page.
(from github.com/hottwerk)
Yes, but is it possible to search url’s based on regex? If so could you give a small example?
(from github.com/marevol)
url:"http://fess.codelibs.org/*"
(from github.com/hottwerk)
Thank you very much!
url:“http://fess.codelibs.org/*” works!
url:“http://fess.codelibs.org/1*/install/*” doesn’t work. Can this be done?
Another question, how to escape characters in the search string? (I searching for a * in the url )
(from marevol (Shinsuke Sugaya) · GitHub)
url:“http://fess.codelibs.org/1*/install/*” doesn’t work.
It will be supported in a next release.
how to escape characters in the search string?
\*
(from marevol (Shinsuke Sugaya) · GitHub)
url:“http://fess.codelibs.org/1*/install/*” doesn’t work.
It will be supported in a next release.
Oops, it’s not correct…
url:“…” is a phrase query with supporting a prefix query.
To use prefix/wildcard query, the query is:
url:http\:\/\/fess.codelibs.org\/*\/12.0\/*
(from github.com/hottwerk)
That seems to work…better. Not totally like I would expect but now I can figure it out now.
Thank you!
© 2020. All Rights Reserved - CodeLibs, Inc.