StackRating

An Elo-based rating system for Stack Overflow
Home   |   About   |   Stats and Analysis   |   Get a Badge
Rating Stats for

Sebastian Nagel

Rating
1513.74 (49,691st)
Reputation
494 (276,074th)
Page: 1 2
Title Δ
Number of records in WARC file 0.00
Nutch/Elastic Search terms definition 0.00
How to get webpage text from Common Crawl? -0.16
Streaming in a gzipped file from s3 in python 0.00
How to index crawled "html" from Apache Nutch to Solr? 0.00
I had some questions on db_redir_temp 0.00
How to retrieve the HTML of a page from CommonCrawl? 0.00
nutch I am reading the content folder from segments.There is differ... 0.00
Nutch http.redirect.max may I know what does it Mean 0.00
nutch job is failing Failed with exit value 255 0.00
I using rest api to get list of jobs running in nutch (nutch 1.17) 0.00
nutch fetch failed with protocol status: exception(16), lastModifie... 0.00
Nutch 1.17 web crawling with storage optimization 0.00
Fetch failed with protocol status: exception(16), lastModified=0: H... 0.00
I want to add the the raw content which is stored in segment folder... 0.00
Nutch Fetch failed with protocol status: moved(12), lastModified=0:... 0.00
How to add in nutch1.17 new urls in seed file will nutch fetch old... 0.00
Apache Nutch 1.17, Dump parsed content with some metadata into JSON +0.23
Nutch Selenium Interactive plugin ignores the chromedriver configur... 0.00
Storm-Crawler and Apache Strom 2.x.x 0.00
what encoding are files after being dumped by nutch? 0.00
Nutch urlflter regex -0.11
Nutch hadoop map reduce java heap space outOfMemory 0.00
Apache Nutch Crawler - Crawl new injected URLs in existing table only 0.00
Nutch segments disk space requirements grow fast 0.00
Transform one field into multiple fields in Solr 0.00
nutch 1.16 parsechecker issue with file:/directory/ inputs 0.00
nutch 1.16 skips file:/directory styled links in file system crawl 0.00
Using S3 as nutch storage system +4.43
Using Apache Solr to index Nutch data 0.00
Nutch 1.6: CSVIndexWriter fails 0.00
Nutch compatibility with Java 11 0.00
How to modify fetch interval of URLs in the crawldb? 0.00
Is it possible to read parquet INT64 timestamp on hive 2.1.1? 0.00
On WARC-Type of entries in StormCrawler WARC files 0.00
Getting No Urls to Fetch error on Nutch1.16 0.00
Nutch/Hadoop: How do I configure the url to track the job? 0.00
How to seed URLs as a text file in StormCrawler? -0.21
Nutch/Hadoop: regex-normalize.xml and regex-urlfilter.txt not found... 0.00
Can't crawl RDF Data with Apache Nutch 0.00
Nutch FetchData job is too slow 0.00
Nutch 1.x: How to use s3a instead of HDFS? 0.00
Provisioning EMR nodes with custom files -3.79
Is Stormcrawler v1.14 compatible with Elasticsearch 6.7.x? 0.00
Download small sample of AWS Common Crawl to local machine via http 0.00
why nutch index to a wrong solr collection even though set solr.ser... 0.00
Does commoncrawl contain only benign URLs? If yes, how they avoid i... 0.00
Id filed in SOLR is diferrent from URL when crawled by nutch for re... 0.00
Is it possible to get titles from the webversion of Common Crawler... 0.00
How to fix issue with nutch readseg not dumping any content? 0.00