Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in large-language-model

How to get the logits of the model with a text classification pipeline from HuggingFace?

Finetuning a LM vs prompt-engineering an LLM

Llama-2 7B-hf repeats context of question directly from input prompt, cuts off with newlines

Langchain OpenAI invoke - unexpected keyword argument "functions"

VertexAIException - list index out of range Error when calling Gemini-Pro API

Implementation (and working) differences between AutoModelForCausalLMWithValueHead vs AutoModelForCausalLM?

Why do I get the error "Unrecognized request argument supplied: functions" when using `functions` when calling Azure OpenAI GPT?

How to keep conversation context of multiple users separate for LLM chatbot coded in Python-Flask

LLM model is not loading into the GPU even after BLAS = 1, LlamaCpp, Langchain, Mistral 7b GGUF Model

litellm.BadRequestError: LLM Provider NOT provided. Pass in the LLM provider you are trying to call

Streaming ChatGPT's results with Flask and LangChain

Ollama - How to inject context or get model to answer based on context? [closed]

Langchain not returning full response expected from OpenAI model

How to compute sentence level perplexity from hugging face language models?

translation invariance of Rotary Embedding