while trying import csv produced wikipedia data following error:
rolf@shogun:~$ neo4j-import --into wiki.db --id-type string --bad-tolerance 12998873 --skip-bad-relationships true --multiline-fields true --nodes:page entities2.csv --relationships:link links2.csv --stacktrace true --skip-duplicate-nodes true importing contents of these files wiki.db: nodes: :page /home/rolf/entities2.csv relationships: :link /home/rolf/links2.csv available memory: free machine memory: 25.75 gb max heap memory : 6.98 gb nodes [>:??-------------|*properties----------|node:7.63 mb---|label scan-----------|v:??------------]100k done in 485ms prepare node index [*resolve (2412 collisions):15.61 mb-----------------------------------------------------------] 90k done in 377ms calculate dense nodes [>:27.21 mb/|prepare---------------|*divide----------------------------------------------------] 4m done in 52s 534ms relationships [*>:136.04 mb/s---------------------|prepare(2)========================|propert|v:208.52 mb/s--] 7m done in 10s 453ms node --> relationship import error: nodelabelupdates must supplied in order of ascending node id java.lang.illegalargumentexception: nodelabelupdates must supplied in order of ascending node id @ org.neo4j.kernel.api.impl.index.lucenelabelscanwriter.write(lucenelabelscanwriter.java:72) @ org.neo4j.unsafe.impl.batchimport.updatenoderecordsstep.update(updatenoderecordsstep.java:81) @ org.neo4j.unsafe.impl.batchimport.updatenoderecordsstep.update(updatenoderecordsstep.java:38) @ org.neo4j.unsafe.impl.batchimport.updaterecordsstep.process(updaterecordsstep.java:65) @ org.neo4j.unsafe.impl.batchimport.updaterecordsstep.process(updaterecordsstep.java:39) @ org.neo4j.unsafe.impl.batchimport.staging.processorstep$4.run(processorstep.java:120) @ org.neo4j.unsafe.impl.batchimport.staging.processorstep$4.run(processorstep.java:102) @ org.neo4j.unsafe.impl.batchimport.executor.dynamictaskexecutor$processor.run(dynamictaskexecutor.java:237)
i've tried filtering out & , / still same error (was mentioned in similar question).
the relationships csv (links2.csv) contains references don't exist in entities2.csv since it's small segment of data.
i'm using neo4j 2.2.5
this known issue fixed in codebase, see https://github.com/neo4j/neo4j/commit/45520e329403e166743b0027e75f2f658019ceae. either wait 2.2.6 or next release in 2.3 branch (either milestone or rc). alternatively grab sources , build snapshot on own.
Comments
Post a Comment