Skip to content

Commit

Permalink
Clarifications, readability
Browse files Browse the repository at this point in the history
  • Loading branch information
Kristian Aune committed Jul 7, 2023
1 parent aeace53 commit 7644c69
Show file tree
Hide file tree
Showing 2 changed files with 7 additions and 6 deletions.
6 changes: 4 additions & 2 deletions examples/embedding-service/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,8 +19,10 @@ which takes a string as an input and returns a vector embedding of that string.
3. Download the models:
```
mkdir -p src/main/application/embedder-models/e5-small-v2
curl -o src/main/application/embedder-models/e5-small-v2/model.onnx https://data.vespa.oath.cloud/onnx_models/e5-small-v2/model.onnx
curl -o src/main/application/embedder-models/e5-small-v2/tokenizer.json https://data.vespa.oath.cloud/onnx_models/e5-small-v2/tokenizer.json
curl -o src/main/application/embedder-models/e5-small-v2/model.onnx \
https://data.vespa.oath.cloud/onnx_models/e5-small-v2/model.onnx
curl -o src/main/application/embedder-models/e5-small-v2/tokenizer.json \
https://data.vespa.oath.cloud/onnx_models/e5-small-v2/tokenizer.json
```
4. Add a public certificate: ``vespa auth cert``
5. Compile and deploy the application: ``mvn install && vespa deploy --wait 600``
Expand Down
7 changes: 3 additions & 4 deletions examples/embedding-service/src/main/application/services.xml
Original file line number Diff line number Diff line change
Expand Up @@ -85,11 +85,10 @@
- Reference: https://docs.vespa.ai/en/reference/services.html
-->
<nodes>
<!--
At least 16Gb of memory is required, since even the smallest model consumes quite a lot of memory.
-->
<resources vcpu="4.0" memory="16Gb"/>
<node hostalias="node1"/>

<!-- Example of configuring more memory for larger models, using Vespa Cloud -->
<resources vcpu="4.0" memory="16Gb"/>
</nodes>
</container>

Expand Down

0 comments on commit 7644c69

Please sign in to comment.