Hello all,
I am creating the graph database for 'New york air bnb open data', after deleting the null columns and ensuring the correct datatype of respective field I'm still face the database error as following:
DatabaseError: {code: Neo.DatabaseError.Statement.ExecutionFailed} {message: At test.csv - Google Drive @ position 4771 - there's a field starting with a quote and whereas it ends that quote there seems to be characters in that field after that ending quote. That isn't supported. This is what I read: 'nQyAE":'}
The problem what I see there is no any field starting with the quote that I see in the dataset.
Regards,
Aditya
@adityachidre2005
What version of Neo4j?
How are you reading the CSV data included in the dataset? Is this via neo4j-admin database import
? or CYPHER LOAD CSV
or ?
CSV file format
The CSV file format and LOAD CSV interact as follows:
* The file character encoding must be UTF-8.
* The line terminator is system dependent (\n for Unix and \r\n for Windows).
* The default field delimiter is ,. Change it with the option FIELDTERMINATOR.
* CSV files may contain quoted STRING values, and the quotes are dropped when LOAD CSV reads the data.
* If dbms.import.csv.legacy_quote_escaping is set to the default value of true, \ is used as an escape character.
* A double quote must be in a quoted STRING and escaped, with either the escape character or a second double quote.
and specifically the last line above
I'm using neo4j-5.27.0 trying to import the csv using cypher query LOAD CSV WITH HEADERS FROM
@adityachidre2005
and per my prior update and
* A double quote must be in a quoted STRING and escaped, with either the escape character or a second double quote.
but yet the error references 'nQyAE":'}
which does contain a double quote and yet it is not escaped not is there a second double quote. So the error is expected
Thank you for your insights.
Yeah I got that but when I'm analyzing the dataset the mentioned field is not at all there. Though I have tried with some data preprocessing I'm still getting the same error. Is there a way we can handle this?
@adityachidre2005
what csv file are you reading with LOAD CSV ? Did you download the CSV from New York Airbnb Open Data | Kaggle as a .zip and then extract or are you programatically reading via the Kaggle API?
Yes, download and extract. While loading it I'm using 'google-drive' link. The csv files on which I'm working are calendar.csv and reviews.csv.