cewl----字典生成

简介

kali系统自带的一款工具,会根据网页的内容,自动生成字典。

详解

cewl -h查看帮助文档。

代码语言:javascript
复制
┌──(root㉿kali)-[/usr/share/wordlists]
└─# cewl -h
CeWL 5.5.2 (Grouping) Robin Wood (robin@digi.ninja) (https://digi.ninja/)
Usage: cewl [OPTIONS] ... <url>
OPTIONS:
    -h, --help: Show help.
    -k, --keep: Keep the downloaded file.
    -d &lt;x&gt;,--depth &lt;x&gt;: Depth to spider to, default 2.
    -m, --min_word_length: Minimum word length, default 3.
    -o, --offsite: Let the spider visit other sites.
    --exclude: A file containing a list of paths to exclude
    --allowed: A regex pattern that path must match to be followed
    -w, --write: Write the output to the file.
    -u, --ua &lt;agent&gt;: User agent to send.
    -n, --no-words: Don&#39;t output the wordlist.
    -g &lt;x&gt;, --groups &lt;x&gt;: Return groups of words as well
    --lowercase: Lowercase all parsed words
    --with-numbers: Accept words with numbers in as well as just letters
    --convert-umlauts: Convert common ISO-8859-1 (Latin-1) umlauts (ä-ae, ö-oe, ü-ue, ß-ss)
    -a, --meta: include meta data.
    --meta_file file: Output file for meta data.
    -e, --email: Include email addresses.
    --email_file &lt;file&gt;: Output file for email addresses.
    --meta-temp-dir &lt;dir&gt;: The temporary directory used by exiftool when parsing files, default /tmp.
    -c, --count: Show the count for each word found.
    -v, --verbose: Verbose.
    --debug: Extra debug information.

    Authentication
    --auth_type: Digest or basic.
    --auth_user: Authentication username.
    --auth_pass: Authentication password.

    Proxy Support
    --proxy_host: Proxy host.
    --proxy_port: Proxy port, default 8080.
    --proxy_username: Username for proxy, if required.
    --proxy_password: Password for proxy, if required.

    Headers
    --header, -H: In format name:value - can pass multiple.

&lt;url&gt;: The site to spider.</code></pre></div></div><p>Bash</p><h3 id="9vsdi" name="%E5%8F%82%E6%95%B0%E8%AE%B2%E8%A7%A3">参数讲解</h3><div class="table-wrapper"><table><thead><tr><th style="text-align:left"><div><div class="table-header"><p>-h、 --help</p></div></div></th><th style="text-align:left"><div><div class="table-header"><p>显示帮助</p></div></div></th></tr></thead><tbody><tr><td style="text-align:left"><div><div class="table-cell"><p>-k、 --keep</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>保留下载的文件</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>-d,--depth</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>探测网站路径的深度,默认值为2</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>-m、 --min_word_length</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>最小单词长度,默认值为3</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>-o、 --offsite</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>访问其他站点</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--exclude</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>包含要排除的路径列表的文件</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--allowed</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>路径必须匹配才能遵循的正则表达式模式</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>-w、 --write</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>将输出写入文件</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>-u、 --ua<agent></p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>要发送的用户代理</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>-n、 --no words</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>不输出单词列表</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>-g,--groups</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>同时返回单词组</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--lowercase</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>小写所有已分析的单词</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--with-numbers</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>接受带数字和字母的单词</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--convert-umlauts</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>转换常见的ISO-8859-1(拉丁语-1)元音变音(ä-ae,ö-oe,ü-ue,ß-ss)</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>-a、 --meta</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>包括元数据</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--meta_file file</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>元数据的输出文件</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>-e, --email</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>包括电子邮件地址</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--email_file<file></p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>电子邮件地址的输出文件</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--meta temp dir<dir></p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>exiftool在解析文件时使用的临时目录,默认为/tmp</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>-c、 --count</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>显示找到的每个单词的计数</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>-v、 --verbose</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>显示详细信息</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--debug</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>额外的调试信息</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--auth_type</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>摘要或基本</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--auth_user</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>身份验证用户名</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--auth_pass</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>身份验证密码</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--proxy_host</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>代理主机</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--proxy_port</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>代理端口,默认8080</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--proxy_username</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>代理的用户名(如果需要)</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--proxy_password</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>代理的密码(如果需要)</p></div></div></td></tr><tr><td style="text-align:left"><div><div class="table-cell"><p>--header,-H</p></div></div></td><td style="text-align:left"><div><div class="table-cell"><p>格式为name:value-可以传递多个</p></div></div></td></tr></tbody></table></div><h3 id="f7b01" name="%E5%AE%9E%E4%BE%8B">实例</h3><div class="rno-markdown-code"><div class="rno-markdown-code-toolbar"><div class="rno-markdown-code-toolbar-info"><div class="rno-markdown-code-toolbar-item is-type"><span class="is-m-hidden">代码语言:</span>javascript</div></div><div class="rno-markdown-code-toolbar-opt"><div class="rno-markdown-code-toolbar-copy"><i class="icon-copy"></i><span class="is-m-hidden">复制</span></div></div></div><div class="developer-code-block"><pre class="prism-token token line-numbers language-javascript"><code class="language-javascript" style="margin-left:0">┌──(root㉿kali)-[~]

└─# cewl -d 0 -m 8 http://onlinebooks.library.upenn.edu/
CeWL 5.5.2 (Grouping) Robin Wood (robin@digi.ninja) (https://digi.ninja/)
Listings
Everything
centered
February
Subjects
Everybody
Libraries
FEATURES
Celebration
ARCHIVES
Language
Specialty
Involved
Progress
Requested
Ockerbloom
onlinebooks
copyrights
licenses

Bash

结语

注意通过 -d 参数控制探测网站路径的深度。如果网站比较大,速度很慢,需要很长时间。