|
Distribution |
Debian unstable |
Abteilung |
java |
Quelle |
hadoop |
Version |
0.20.2+dfsg1-2 |
Maintainer |
Debian Java Maintainers <pkg-java-maintainers@lists.alioth.debian.org>
|
Beschreibung |
Hadoop is a software platform for writing and running applications that process vast amounts of data on a distributed file system. . Here's what makes Hadoop especially useful: * Scalable: Hadoop can reliably store and process petabytes. * Economical: It distributes the data and processing across clusters of commonly available computers. These clusters can number into the thousands of nodes. * Efficient: By distributing the data, Hadoop can process it in parallel on the nodes where the data is located. This makes it extremely rapid. * Reliable: Hadoop automatically maintains multiple copies of data and automatically redeploys computing tasks based on failures. . The org.apache.hadoop.contrib.index.main.UpdateIndex library provides support for managing an index using MapReduce. A distributed "index" is partitioned into "shards", each corresponding to a Lucene instance. This library's main() method uses a MapReduce job to analyze documents and update Lucene instances in parallel.
|
Abhängig von | libhadoop-java (= 0.20.2+dfsg1-2), liblucene2-java |
Offizielle Seiten |
Paket
Entwicklerinformationen
Bugs (Binärpaket)
Bugs (Quellpaket) |
Download |
all |
|
|
|