Server Header Code
HTTP/1.1 | 301 Moved Permanently |
Connection | close |
Content-Length | 0 |
Server | Varnish |
Retry-After | 0 |
Location | https://nutch.apache.org/ |
Accept-Ranges | bytes |
Date | Fri, 29 Mar 2024 07:00:22 GMT |
Via | 1.1 varnish |
X-Served-By | cache-lon4224-LON |
X-Cache | HIT |
X-Cache-Hits | 0 |
X-Timer | S1711695623.627026,VS0,VE0 |
One Word
Total: 31
Density | Keyword | Repeat | Headings | Link |
---|---|---|---|---|
7.50% | apache | 15 | 1 | 6 |
5.50% | nutch | 11 | 1 | 2 |
3.50% | 0 | 7 | 0 | 0 |
2.00% | highly | 4 | 0 | 0 |
2.00% | data | 4 | 0 | 0 |
1.50% | extensible | 3 | 1 | 0 |
1.50% | foundation | 3 | 0 | 1 |
1.50% | software | 3 | 0 | 1 |
1.50% | scalable | 3 | 1 | 0 |
1.00% | download | 2 | 0 | 2 |
1.00% | hit | 2 | 0 | 0 |
1.00% | varnish | 2 | 0 | 0 |
1.00% | e | 2 | 0 | 0 |
1.00% | logo | 2 | 0 | 0 |
1.00% | indexing | 2 | 0 | 1 |
1.00% | 1 | 2 | 0 | 0 |
1.00% | html | 2 | 0 | 1 |
1.00% | gmt | 2 | 0 | 0 |
1.00% | accomodates | 2 | 0 | 0 |
1.00% | configuration | 2 | 0 | 0 |
Two Word Phrases
Total: 25
Density | Keyword | Repeat | Headings | Link |
---|---|---|---|---|
3.02% | apache nutch | 6 | 1 | 2 |
1.51% | 1 1 | 3 | 0 | 0 |
1.51% | apache software | 3 | 0 | 1 |
1.51% | nutch apache | 3 | 0 | 1 |
1.01% | crawler enables | 2 | 0 | 0 |
1.01% | 1 varnish | 2 | 0 | 0 |
1.01% | software foundation | 2 | 0 | 1 |
1.01% | nutch highly | 2 | 0 | 0 |
1.01% | nutch nutch | 2 | 0 | 0 |
1.01% | fine grained | 2 | 0 | 0 |
1.01% | extensible highly | 2 | 0 | 0 |
1.01% | scalable matured | 2 | 0 | 0 |
1.01% | production-ready web | 2 | 0 | 0 |
1.01% | configuration accomodates | 2 | 0 | 0 |
1.01% | acquisition tasks | 2 | 0 | 0 |
1.01% | variety data | 2 | 0 | 0 |
1.01% | grained configuration | 2 | 0 | 0 |
1.01% | accomodates wide | 2 | 0 | 0 |
1.01% | web crawler | 2 | 0 | 1 |
1.01% | enables fine | 2 | 0 | 0 |
Geolocation
IP Address | Country | ISO Code |
---|---|---|
151.101.2.132 | United States | US |
Link Ratio
Internal Links | External Links | Internal Link Percentage |
---|---|---|
10 | 12 | 45.45% |
External Follow Links
Total: 12
Link | URL |
---|---|
Web crawler | https://en.wikipedia.org/wiki/Web_crawler |
View on Github | https://github.com/apache/nutch |
Get Started | https://cwiki.apache.org/confluence/display/NUTCH/... |
Apache Hadoop? | https://hadoop.apache.org |
Apache Tika? | https://tika.apache.org |
Apache Solr? | https://solr.apache.org |
Elasticsearch | https://www.elastic.co/elastic-stack/ |
Parsers | https://ci-builds.apache.org/job/Nutch/job/Nutch-t... |
HTML Filtering | https://ci-builds.apache.org/job/Nutch/job/Nutch-t... |
Indexing | https://ci-builds.apache.org/job/Nutch/job/Nutch-t... |
Scoring | https://ci-builds.apache.org/job/Nutch/job/Nutch-t... |
kube Theme for Hugo | https://github.com/jeblister/kube |
Check last run at 07:00:22 - Fri 29 Mar 2024, taking 0.38 seconds