登录
首页 » Java » nutch-0.8

nutch-0.8

于 2007-07-18 发布 文件大小:445KB
0 235
下载积分: 1 下载次数: 32

代码说明:

  nutch-0.8刚出来不久的一个很好用的搜索引擎工具 nutch-0.8刚出来不久的一个很好用的搜索引(nutch-0.8 has just come out near a very good tool to use search engine nutch-0.8 has just come out soon with a good primer of english)

文件列表:

META-INF
........\MANIFEST.MF
nutch-default.xml
nutch-site.xml
org
...\apache
...\......\nutch
...\......\.....\analysis
...\......\.....\........\AnalyzerFactory.class
...\......\.....\........\CharStream.class
...\......\.....\........\CommonGrams$ArrayTokens.class
...\......\.....\........\CommonGrams$Filter.class
...\......\.....\........\CommonGrams.class
...\......\.....\........\FastCharStream.class
...\......\.....\........\NutchAnalysis$1.class
...\......\.....\........\NutchAnalysis$JJCalls.class
...\......\.....\........\NutchAnalysis$LookaheadSuccess.class
...\......\.....\........\NutchAnalysis.class
...\......\.....\........\NutchAnalysisConstants.class
...\......\.....\........\NutchAnalysisTokenManager.class
...\......\.....\........\NutchAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer$1.class
...\......\.....\........\NutchDocumentAnalyzer$AnchorAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer$AnchorFilter.class
...\......\.....\........\NutchDocumentAnalyzer$ContentAnalyzer.class
...\......\.....\........\NutchDocumentAnalyzer.class
...\......\.....\........\NutchDocumentTokenizer.class
...\......\.....\........\ParseException.class
...\......\.....\........\Token.class
...\......\.....\........\TokenManager.class
...\......\.....\........\TokenMgrError.class
...\......\.....\clustering
...\......\.....\..........\HitsCluster.class
...\......\.....\..........\OnlineClusterer$1.class
...\......\.....\..........\OnlineClusterer.class
...\......\.....\..........\OnlineClustererFactory.class
...\......\.....\crawl
...\......\.....\.....\Crawl.class
...\......\.....\.....\CrawlDatum$Comparator.class
...\......\.....\.....\CrawlDatum.class
...\......\.....\.....\CrawlDb.class
...\......\.....\.....\CrawlDbMerger$Merger.class
...\......\.....\.....\CrawlDbMerger.class
...\......\.....\.....\CrawlDbReader$CrawlDbDumpReducer.class
...\......\.....\.....\CrawlDbReader$CrawlDbStatMapper.class
...\......\.....\.....\CrawlDbReader$CrawlDbStatReducer.class
...\......\.....\.....\CrawlDbReader$CrawlDbTopNMapper.class
...\......\.....\.....\CrawlDbReader$CrawlDbTopNReducer.class
...\......\.....\.....\CrawlDbReader.class
...\......\.....\.....\CrawlDbReducer.class
...\......\.....\.....\Generator$HashComparator.class
...\......\.....\.....\Generator$Selector.class
...\......\.....\.....\Generator$SelectorEntry.class
...\......\.....\.....\Generator$SelectorInverseMapper.class
...\......\.....\.....\Generator.class
...\......\.....\.....\Injector$InjectMapper.class
...\......\.....\.....\Injector$InjectReducer.class
...\......\.....\.....\Injector.class
...\......\.....\.....\Inlink.class
...\......\.....\.....\Inlinks.class
...\......\.....\.....\LinkDb$1.class
...\......\.....\.....\LinkDb$2.class
...\......\.....\.....\LinkDb$Merger.class
...\......\.....\.....\LinkDb.class
...\......\.....\.....\LinkDbMerger.class
...\......\.....\.....\LinkDbReader.class
...\......\.....\.....\MapWritable$ClassIdEntry.class
...\......\.....\.....\MapWritable$KeyValueEntry.class
...\......\.....\.....\MapWritable.class
...\......\.....\.....\MD5Signature.class
...\......\.....\.....\PartitionUrlByHost.class
...\......\.....\.....\Signature.class
...\......\.....\.....\SignatureComparator.class
...\......\.....\.....\SignatureFactory.class
...\......\.....\.....\TextProfileSignature$1.class
...\......\.....\.....\TextProfileSignature$Token.class
...\......\.....\.....\TextProfileSignature$TokenComparator.class
...\......\.....\.....\TextProfileSignature.class
...\......\.....\fetcher
...\......\.....\.......\Fetcher$FetcherThread.class
...\......\.....\.......\Fetcher$InputFormat.class
...\......\.....\.......\Fetcher.class
...\......\.....\.......\FetcherOutput.class
...\......\.....\.......\FetcherOutputFormat$1.class
...\......\.....\.......\FetcherOutputFormat.class
...\......\.....\html
...\......\.....\....\Entities.class
...\......\.....\indexer
...\......\.....\.......\DeleteDuplicates$1.class
...\......\.....\.......\DeleteDuplicates$2.class
...\......\.....\.......\DeleteDuplicates$HashPartitioner.class
...\......\.....\.......\DeleteDuplicates$HashReducer.class
...\......\.....\.......\DeleteDuplicates$HashScore.class
...\......\.....\.......\DeleteDuplicates$IndexDoc.class
...\......\.....\.......\DeleteDuplicates$InputFormat.class
...\......\.....\.......\DeleteDuplicates.class
...\......\.....\.......\FsDirectory$1.class
...\......\.....\.......\FsDirectory$DfsIndexInput$Descriptor.class
...\......\.....\.......\FsDirectory$DfsIndexInput.class
...\......\.....\.......\FsDirectory$DfsIndexOutput.class

下载说明:请别用迅雷下载,失败请重下,重下不扣分!

发表评论

0 个回复

  • weibobee_OpenSrc
    新浪微博爬虫程序,小蜜蜂,新浪微博爬虫程序,小蜜蜂(Sina micro-blog crawler, small bee,Sina micro-blog crawler, small bee)
    2013-09-25 09:19:51下载
    积分:1
  • searchView
    说明:  基于路岑呢的搜索功能,可以检索并建立索引等等(Search function based on Lucen, can search and build index, etc.)
    2020-06-23 02:40:01下载
    积分:1
  • Chess
    基于剪枝技术的一字棋博弈系统,理解和掌握博弈树的启发式搜索过程,能够用某种程序语言建立一个简单的博弈系统(Pruning techniques based word chess game systems, understand and master the game tree heuristic search process, we can build a simple game system in some programming language)
    2015-12-20 15:56:45下载
    积分:1
  • lucene
    站内搜索lucene使用实例 (stations examples of the use of search lucene station examples of the use of search lucene)
    2007-07-03 17:17:00下载
    积分:1
  • STM32F10X数字电子时钟
    STM3210F10X数字电子时钟下载,有时钟的各个功能和作用,希望管理员能看在我一片学程序的赤诚的心,通过我的审核。 跪求文件啊亲,一定要让我通过。
    2022-01-27 09:45:34下载
    积分:1
  • Zernike-Moment
    关于泽尼克矩的应用于二维图形文件的搜索。(Zernike Moment)
    2015-05-11 11:40:18下载
    积分:1
  • htdig-3.1.6.tar
    比较大型的网络搜索引擎,C++实现,可惜只支持unix系统(relatively large network search engines, C realized, but unfortunately, only unix support system)
    2007-03-14 22:21:08下载
    积分:1
  • 048575
    百度搜索源码例程,程序结合易语言超文本浏览框支持库,提交URL搜索地址在百度进行搜索。(Baidu search code samples , combined with easy language program hypertext browsing box support library , submit URL address search Baidu search.)
    2016-01-04 15:04:09下载
    积分:1
  • python_sina_crawl
    新浪微博的爬虫程序。程序运行方式:保存所有代码后,打开Main.py,修改LoginName为你的新浪微博帐号,PassWord为你的密码。运行Main.py,程序会在当前目录下生成CrawledPages文件夹,并保存所有爬取到的文件在这个文件夹中。(Sina microblogging reptiles. Program operation: save all the code, open Main.py, modify LoginName for your Sina Weibo account, PassWord for your password. Run Main.py, the program will generate CrawledPages in the current directory folder and save all files to crawling in this folder.)
    2021-04-08 16:39:00下载
    积分:1
  • 4714
    搜索论坛最新主题搜例程,源码演示取论坛最新主题20贴,读取论坛帖子地址列表,使用正则搜索地址文本。(Search Latest Forum Posts search routines , source code demonstrate fetch Latest Forum Posts 20 , read forum posts address list , search for addresses using regular text .)
    2015-07-28 19:57:07下载
    积分:1
  • 696516资源总数
  • 106914会员总数
  • 0今日下载