Some of the columns of my TSV file contain sentences or paragraphs ("sentence file"".
Another of my TSV files is a dictionary of single or multi-word terms of interest ("dictionary file").
Can neo4j identify just the words/terms from the dictionary file that appear in the context of the sentences file?
If so:
A) is there a specific cypher query you can provide to get me started in the right direction? B) Is there anything special I need to do/prepare in either of the two TSV files to make this possible? (I don't think I could possibly provide a stopwords list that would help in this situation, by the way)
I am brand new to neo4j, so please explain like I'm five. ;)
Neo4j does have the ability to search a text string, such as your sentences, for a phrase (such as one of your dictionary words) using the string predicate ‘contains’. You could index the property that contained the sentences to improve search performance.
not sure what you mean to do exactly but if these are your sentence.tsv and dictionary.tsv files
sentence
test sentence
Kitty is a cat
It's sunny today
Adam has a cat
word
cat
sunny
you need to put both in the import directory of your db
then run this to import them as nodes
load csv WITH HEADERS from "file:///sentences.tsv" AS line FIELDTERMINATOR '\t'
create (:Sentence {text:line.sentence})
load csv WITH HEADERS from "file:///dictionary.tsv" AS line FIELDTERMINATOR '\t'
create (:Word {text:line.word})