RAG on Mac (M3) with langchain (RetrievalQA): code runs indefinitely
I’m trying to run a RAG system on my mac M3-pro (18gb RAM) using langchain and `Llama-3.2-3B-Instruct` on a jupyter notebook (and the vector storage is Milvus). When I am invoking RetrievalQA.from_chain_type, the cell is running indefinitely (at least 15 mins, did not let it run longer…). from langchain.chains import RetrievalQA qa_chain = RetrievalQA.from_chain_type( llm=llm, […]