Posts tagged ‘java’

Create a website broken links checker

This tutorial will show you how to extend Norconex HTTP Collector using Java to create a link checker to ensure all URLs in your web pages are valid. The link checker will crawl your target site(s) and create a report ... Read More...

Posted on February 10, 2015 by in Latest Articles

Major upgrades to Norconex crawlers

Norconex just released major upgrades to all its Norconex Collectors and related projects.  That is, Norconex HTTP Collector and Norconex Filesystem Collector, along with the Norconex Importer module and all available committers (Solr, Elasticsearch, HP IDOL, etc), were all upgraded ... Read More...

Posted on November 27, 2014 by in Latest Releases

Norconex Announces Availability of Norconex Filesystem Collector

GATINEAU, QC, CANADA – Thursday, August 25, 2014 – Norconex is announcing the launch of Norconex Filesystem Collector, providing organizations with a free “universal” filesystem crawler. The Norconex Filesystem Collector enables document indexing into target repositories of choice, such as ... Read More...

Posted on August 25, 2014 by in Latest Releases

Norconex Importer 1.3.0 Released

Release 1.3.0 of Norconex Importer is now available.  Release overview: Now stores the content “family” for each documents as “importer.contentFamily”. New SplitTagger: Split values into multiple-values using a separator of choice. New CopyTagger: copies document metadata fields to other fields. ... Read More...

Posted on August 19, 2014 by in Latest Releases

Exploring Norconex Commons Lang

Norconex Commons Lang is a generic Java library providing useful utility classes that extend the base Java API.  Its name is shamelessly borrowed from Apache Commons Lang, so people can quickly assume what it’s about just by its name.   It ... Read More...

Posted on September 19, 2013 by in Latest Articles

Sorry, no posts matched your criteria.