neo4j/text2cypher-gemma-2-9b-it-finetuned-2024v1 in hugging face api inference

devanath.a · February 6, 2025, 5:16pm

Hi,

I want to run neo4j/text2cypher-gemma-2-9b-it-finetuned-2024v1 using Hugging Face API inference. When I tried to deploy this model, I received a warning stating that handler.py is missing.

I tried using my access token for Google Gemma and deploying it with the following hardware configuration:

Nvidia T4 (4 GPUs, 64GB)
46 vCPUs with 192GB RAM
I encountered the following error:

[Previous line repeated 2 more times]
File "/usr/local/lib/python3.11/dist-packages/torch/nn/modules/module.py", line 804, in _apply
param_applied = fn(param)
^^^^^^^^^
File "/usr/local/lib/python3.11/dist-packages/torch/nn/modules/module.py", line 1159, in convert
return t.to(
^^^^^
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 56.00 MiB. GPU
Application startup failed. Exitin

How can I successfully use this model in API inference?
Thanks in advance!

Topic		Replies	Views
New finetuned text2cypher model based on Llama3 Community Content & Blogs	0	85	May 17, 2024
Knowledge Graph Builder App Local deployment Neo4j Graph Platform	12	152	December 13, 2024
Neo4j memory allocation Modeling performance , operations	1	49	February 26, 2025
Neo4j Out of Memory error when using python Newbie Questions memory , out-of-memory	1	716	January 6, 2021
Database crashes after too many queries? Python	4	1151	June 26, 2020

Get Certified in June!

neo4j/text2cypher-gemma-2-9b-it-finetuned-2024v1 in hugging face api inference

Related topics