
Closed
Posted
Paid on delivery
I'm seeking an experienced AI LLM Consultant with a proven track record in developing and implementing Artificial Intelligence solutions based on Large Language Models (LLMs). The goal is to create a locally deployable LLM system (on-premise), PRE-TRAINED in the financial domain (Mistral, FinBERT, Bloom, FinGPT, BloombergGPT, ...), capable of learning from internal documentation (Intranet) and prioritizing this internal information. The environment is Windows Server 2019 with the ability to run virtual machines. Ensure the system operates fully offline, independent of any external cloud services. NO PLACEHOLDERS Responsibilities: - Evaluate and select the most suitable open-source LLM for a Windows server environment and specific financial needs (DeepSeek is preferred, then Mistral 7B). - Design and implement the architecture for on-premise LLM installation, ensuring complete independence from external cloud services or other AI LLM APIs. - Develop and configure the system for ingesting and indexing documentary sources from our Intranet. - Implement mechanisms to PRIORITIZE information from the Intranet over the LLM's pre-trained knowledge base (LoRA/QLoRA). At the end of project: - Provide technical support and training to our internal team for managing and further training the system. - Document the entire architecture and implementation processes. Essential Technical Requirements: - Familiarity with Windows Server environment and managing hardware resources for AI. - Deep understanding of Large Language Models (LLMs) and their architectures. - Proven experience with open-source LLMs (e.g., Llama, Falcon, Mistral, etc.). - Hands-on experience in installing and configuring on-premise LLMs on Windows servers. - Familiarity with fine-tuning techniques, RAG (Retrieval Augmented Generation), and continuous learning. Desirable Requirements: - Previous experience in AI projects within the financial sector. ----- Would you like to bid on this project? ----- To best evaluate your application and technical approach, please create a document (WORD/PDF) addressing the following points: - Minimum Hardware Estimation: Provide a detailed estimate of the minimum hardware requirements (CPU, RAM, GPU, storage) for a Windows server capable of hosting the proposed LLM system, considering both training and inference needs with our internal documentation. - Required Software: List the essential software components (operating systems, runtimes, libraries, specific tools) that would be necessary to implement and operate the on-premise LLM system. - Proposed Architecture Design: Present a preliminary design of the architecture to be implemented, specifying the main components (logical units, databases, applications, virtual machines, etc.). Please note that considerations for High Availability (HA), Disaster Recovery (DR), or Load Balancing are not required. - how improve answer accracy and reduce allucinations. - A detailed activity plan (day/task) as part of your proposal. ----- How will it be used? ----- Mainly via API/WebService: a web application (ASP.NET) will collect requests from the connected user and request answers from the AI service. However, a web interface (such as ChatGPT, Claude, ...) is required to make impromptu requests. ----- Costs and quality ----- Please pay attention on environment performances. Please, no ask me «what is your budget for this project?» Bonuses provided at the end of the project for compliance with the timing and quality of the software. No upfront. No payment before successful completion of all tests. Unnecessary images, files, libraries, ... must be removed. ----- Collaboration ----- The consulting engagement will commence in March and will be conducted full-remote. Participation in daily update meetings is mandatory and non-negotiable. Failure to adhere to this requirement will result in immediate project disengagement, without exception.
Project ID: 40255370
281 proposals
Remote project
Active 28 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
281 freelancers are bidding on average €549 EUR for this job

⭐⭐⭐⭐⭐ Build an On-Premise LLM System for Financial Insights ❇️ Hi My Friend, I hope you are doing well. I reviewed your project requirements and see you are looking for an AI LLM Consultant. You don’t need to look any further; Zohaib is here to help you! My team has successfully completed over 50 similar projects in AI solutions. I will evaluate and select the best open-source LLM for your Windows Server environment and financial needs, ensuring it learns from your internal documentation. ➡️ Why Me? I can easily create your on-premise LLM system as I have 5 years of experience in AI and LLMs. My skills include working with various LLM architectures, installing on-premise systems, and fine-tuning models for specific applications. I also have a strong grip on managing Windows Server environments and optimizing performance for AI tasks. ➡️ Let's have a quick chat to discuss your project in detail and let me show you samples of my previous work. I look forward to chatting with you! ➡️ Skills & Experience: ✅ AI Solutions ✅ LLM Development ✅ Windows Server Management ✅ Open-Source LLMs ✅ System Architecture Design ✅ Fine-Tuning Techniques ✅ API Integration ✅ Data Ingestion ✅ Indexing Systems ✅ Technical Training ✅ Documentation Skills ✅ Problem Solving Waiting for your response! Best Regards, Zohaib
€350 EUR in 2 days
7.8
7.8

⭐⭐⭐⭐⭐ We are ready to bid and deliver a fully on-premise financial LLM platform. CnELIndia, led by Raman Ladhani, will: Evaluate DeepSeek and Mistral 7B (finance-tuned variants such as FinGPT/FinBERT) against your datasets and select the optimal model for Windows Server 2019 with GPU passthrough. Design an isolated VM-based architecture: LLM VM (GPU), RAG/Index VM, API VM (.NET bridge), and secure storage. Implement RAG with vector DB (FAISS/Qdrant), prioritized Intranet ingestion, and LoRA/QLoRA fine-tuning to weight internal knowledge over base pretraining. Deliver REST API for ASP.NET and a secure web chat UI. Minimum hardware (baseline): Dual Xeon/EPYC 16+ cores, 128GB RAM, 1× NVIDIA A100 40GB or RTX 6000 Ada (min 24GB VRAM), 4TB NVMe SSD. Software: Windows Server 2019, Hyper-V, Ubuntu VM, CUDA, PyTorch, Transformers, LangChain, FAISS/Qdrant, .NET 6+, IIS. Plan (4–6 weeks): Week 1: Assessment & hardware validation. Week 2: Environment & base LLM deployment. Week 3: RAG + ingestion pipeline. Week 4: LoRA fine-tuning & prioritization. Week 5: API, UI, performance tuning. Week 6: Documentation, training, testing & handover. Daily updates guaranteed. Full documentation, zero cloud dependency, performance-optimized delivery.
€500 EUR in 7 days
7.5
7.5

Hi, Your project for deploying a locally hosted AI LLM tailored for financial data analysis on a Windows Server environment shows a deep understanding of the need for data security and domain-specific intelligence. With extensive experience in AI, particularly with open-source LLMs such as Mistral and fine-tuning techniques like LoRA, I’m confident in architecting a solution that fully operates offline and prioritizes your intranet data. I will evaluate suitable models, design the architecture for Windows Server 2019 leveraging virtual machines for isolation, implement document ingestion and prioritization via RAG methods, and ensure seamless API integration for your ASP.NET frontend. I'll provide comprehensive documentation and training to your team. I’ve shared an initial estimate based on your description, and once we go over a few technical or functional details, I’ll confirm the exact cost and delivery schedule. The next step is to review your current server specs and data volume to tailor hardware needs and fine-tuning approach. Could you share details about the volume and format of your internal financial documents, and your current server hardware specifications? Thanks, Asad
€250 EUR in 10 days
6.9
6.9

Hello, I am an AI engineer with hands-on experience deploying open-source LLM systems fully on-premise, including financial-domain adaptation, RAG pipelines, and LoRA fine-tuning. I can design a Windows Server 2019–compatible architecture using DeepSeek (preferred) or Mistral 7B, optimized for local inference and controlled domain adaptation. My approach prioritizes Retrieval-Augmented Generation over full re-training, ensuring your Intranet knowledge base is ranked above base model knowledge without degrading model stability. Minimum Hardware Estimation: For reliable inference with a 7B quantized model, I recommend 1× NVIDIA RTX 4090 (24GB VRAM) or A6000 (48GB VRAM preferred for training), 128GB RAM, 16-core CPU, and NVMe SSD (2TB minimum). For LoRA fine-tuning, 48GB VRAM significantly improves efficiency Required Software: Windows Server 2019, WSL2 (Ubuntu), Docker, NVIDIA CUDA Toolkit, PyTorch, HuggingFace Transformers, vLLM or Text Generation Inference, LangChain or LlamaIndex for RAG, PostgreSQL or Qdrant for vector storage, and a lightweight web UI such as Open WebUI Information prioritization will be handled through RAG ranking strategies, metadata filtering, hybrid search (BM25 + embeddings), and optional LoRA adaptation trained on curated financial internal datasets. I am available full-remote starting March, can attend mandatory daily update meetings, and will ensure performance optimization and clean dependency management throughout the project. Thanks
€500 EUR in 7 days
6.5
6.5

As an accomplished software architect with a deep understanding of Large Language Models (LLMs) and their architectures, I believe I am the best fit for your AI LLM project. Throughout my 10+ years in the industry, I have successfully implemented numerous AI projects on Windows servers, ensuring maximum productivity while minimizing reliance on external resources. This expertise aligns perfectly with your requirements for a locally deployable, on-premise LLM system. Moreover, my familiarity with fine-tuning techniques, RAG and continuous learning further accentuate my suitability for your project. These skills will play a pivotal role in ingesting and prioritizing information from your Intranet – a necessary aspect for an efficient financial data analysis system. Furthermore, my experience with projects in the financial sector will add an extra level of precision to every step of our project. Lastly, as part of WellSpring Infotech, we are not merely focused on checklisting specifications - we are dedicated to cultivating partnerships. From meticulously estimating hardware needs to creating a detailed activity plan and providing training & documentation at the project's completion – you can count on a solid deliverable from us. Thanks....
€750 EUR in 7 days
6.7
6.7

✅ Lovable AI Expert | AI Development | LLM | Claude✅ Hi, Thank you for considering this opportunity! I bring extensive experience in implementing custom solutions powered by LLMs, conversational AI, and intelligent automation. Recently I have been working on Lovable AI for developing a gaming platform using it, complete with chat-based agent logic, expressive front-ends, and backend integrations. In other project, implemented a fully automated AI agent system for intelligent meeting creation using ElevenLabs Conversational AI and Gemini (via a custom agent brain). The flow integrates voice interaction, natural language processing, location precision, and frontend. Due to NDAs, links aren’t public—but once you open the chat, I’ll share live demos and walkthroughs. Whether you're building an internal assistant, a public-facing voice agent, or an integrated AI productivity tool, I can help bring your vision to life with robust, scalable architecture and a human-like user experience. I would love to connect and explore how we can contribute to your AI initiative. (Note: Budget is flexible — we can finalize it after reviewing the complete scope.) Thanks & Regards, Kajal
€750 EUR in 7 days
6.6
6.6

Hello, I specialize in on-premise LLM systems and built & customized large scale AI platforms for secure environments. The main challenge here is running a financial domain model fully offline on Windows Server while forcing it to prioritize your Intranet data over base knowledge. I am certified in Python and AI development and will solve this using DeepSeek or Mistral 7B with RAG + LoRA on Windows Server 2019, using PyTorch, HuggingFace, FAISS, and a secured REST API for your ASP.NET app. Minimum hardware: 1x RTX 4090 (24GB) or A6000, 128GB RAM, 16-core CPU, 2TB NVMe. Software: Windows Server 2019, WSL2 or VM Linux, CUDA, Python, Docker. Should documents auto-sync from Intranet? How many daily queries expected? Average document size? Need role-based answer control? Best regards, Dev S.
€1,000 EUR in 13 days
6.2
6.2

Hello, I understand you require a locally deployable, open-source LLM system for financial data analysis on a Windows Server 2019 environment, pre-trained on financial corpora and capable of prioritizing internal Intranet documentation. The system must operate fully on-premise with no cloud dependency, support API/WebService integration for ASP.NET applications, and include a web interface for ad-hoc queries. I have extensive experience designing and deploying on-premise LLMs, including DeepSeek, Mistral, and FinGPT, with custom fine-tuning, RAG pipelines, and LoRA/QLoRA approaches for domain prioritization. My methodology involves selecting the optimal LLM (DeepSeek or Mistral 7B) and designing a modular architecture including virtualized model servers, a document ingestion and embedding pipeline, vector database for retrieval, and API interface for web applications. I will configure LoRA/QLoRA fine-tuning to prioritize Intranet sources, implement inference optimization for low-latency responses, and provide complete documentation and internal team training. The deployment will be fully containerized/virtualized to simplify updates and maintenance. The final deliverables will include the fully deployed LLM system, configured API and web interface, optimized inference setup, documentation of architecture and processes, and internal training. The system will be production-ready, secure, and tailored to financial domain data. Thanks, Asif.
€750 EUR in 11 days
6.2
6.2

As a seasoned Senior Full Stack Developer and a Software Architect with over 6 years of experience, I've worked extensively on projects that are akin to the one you're offering. To delve more, I've had the privilege of working with Java, Python, .NET among many other languages you've listed, which would be valuable for managing hardware resources on the Windows Server. Furthermore, with an extensive background in Data Processing and Machine Learning technologies such as Deep Learning (including large language models), my expertise goes to the heart of what your project needs. In terms of evaluating and selecting the most suitable open-source LLM for your local Windows server environment and specific financial needs, I'm particularly familiar with systems such as DeepSeek and Mistral 7B which were among your preferences. Being well-versed in fine-tuning techniques like RAG and QLoRA, I can effectively prioritize the information from your Intranet over the LLM's base knowledge ensuring precise data selection. Additionally, my familiarity with AI projects within the financial sector will give me an extra edge while meeting your task's requirements.
€251 EUR in 4 days
6.2
6.2

I am an experienced AI consultant specializing in Large Language Models (LLMs) with a focus on financial solutions. My proven track record includes developing and implementing LLM systems like Mistral and FinBERT. With expertise in Node.js, React, and PHP, I am confident in designing and deploying an on-premise LLM system on Windows Server 2019. I am skilled in Excel automation and accounting software, ensuring seamless integration with your Intranet. My proposal includes a detailed hardware estimation, software requirements, architecture design, and activity plan. Let's work together to create a high-performance, offline LLM system for your financial needs.
€443 EUR in 7 days
6.1
6.1

Hello Sir, How would you like to see a powerful AI LLM solution tailored for your financial data needs, developed locally just for you? I specialize in designing independent, offline AI systems leveraging open-source LLMs optimized for financial environments and internal data prioritization. Let's connect to discuss how I can help create a robust LLM solution that meets your specifications. Best, Smith
€500 EUR in 7 days
6.3
6.3

Hi there, I’m excited about the opportunity to develop an on-premise LLM system for financial data analysis! With a robust background as a top California freelancer, I have successfully implemented AI solutions utilizing various LLMs including Mistral and FinBERT. My extensive experience in setting up local environments on Windows Server enables me to design an independent architecture that prioritizes your internal documentation while training the model. Understanding your need for a tailored AI solution, I am confident in my ability to evaluate and select the appropriate open-source LLMs, configure them effectively, and provide comprehensive training and documentation for your internal team. I can assure you that I will focus on achieving high performance while meeting your stringent requirements. Please message me to discuss this further, and I can provide a detailed activity plan along with preliminary hardware and software estimates. What specific internal sources will the LLM need to prioritize, and are there any expected challenges you foresee in the implementation?
€610 EUR in 4 days
5.8
5.8

Hello there, I specialize in transforming businesses using AI technologies - and this project seems like an incredible fit. My familiarity with Large Language Models (LLMs) and their architectures, such as the ones you've mentioned (Mistral, FinBERT, Bloom), coupled with my hands-on experience in installing and configuring on-premise LLMs on Windows servers strongly positions me to meet your needs. I have a deep understanding of fine-tuning techniques, which will be invaluable in ensuring the system operates fully offline thereby maintaining complete independence from any external cloud services or other AI LLM APIs. Having successfully developed similar solutions in the past, I'm confident in my ability to design and implement an architecture suitable for your organization's needs while ensuring that the system is capable of learning from your internal documentation (intranet) and prioritizing it over the LLM's pre-trained knowledge base using mechanisms like LoRA/QLoRA. Plus, to make it as easy as possible for your team to manage the system post-implementation, I'll provide comprehensive technical support and training while documenting the entire architecture for future reference. I look forward to your response to building a long-term relationship. Thanks !!
€500 EUR in 7 days
5.9
5.9

Hello, I specialize in on-premise LLM deployments for regulated environments and can design a fully self-hosted financial AI stack on Windows Server 2019 with zero cloud dependency, using DeepSeek or Mistral 7B combined with a secure RAG pipeline that prioritizes your Intranet data over base model knowledge via LoRA/QLoRA and vector indexing. I will deliver a clear hardware sizing plan (CPU/GPU/RAM/storage for both inference and fine-tuning), a precise software stack (CUDA, Ollama/vLLM, embeddings DB, orchestration layer), and a clean architecture blueprint covering VM layout, ingestion pipelines, API layer for ASP.NET integration, and an internal Chat UI — all documented with a structured day-by-day execution plan and team training at handover. I am comfortable with daily remote updates, performance optimization requirements, and milestone-based completion tied to successful validation testing. Best Regards, Arzoo Farooq
€670 EUR in 7 days
5.7
5.7

Hello Dear! I write to introduce myself. I'm Engineer Toriqul Islam. I was born and grew up in Bangladesh. I speak and write in English like native people. I am a B.S.C. Engineer of Computer Science & Engineering. I completed my graduation from Rajshahi University of Engineering & Technology ( RUET). I love to work on Web Design & Development project. Web Design & development: I am a full-stack web developer with more than 10 years of experience. My design Approach is Always Modern and simple, which attracts people towards it. I have built websites for a wide variety of industries. I have worked with a lot of companies and built astonishing websites. All Clients have good reviews about me. Client Satisfaction is my first Priority. Technologies We Use: Custom Websites Development Using ======>Full Stack Development. 1. HTML5 2. CSS3 3. Bootstrap4 4. jQuery 5. JavaScript 6. Angular JS 7. React JS 8. Node JS 9. WordPress 10. PHP 11. Ruby on Rails 12. MYSQL 13. Laravel 14. .Net 15. CodeIgniter 16. React Native 17. SQL / MySQL 18. Mobile app development 19. Python 20. MongoDB What you'll get? • Fully Responsive Website on All Devices • Reusable Components • Quick response • Clean, tested and documented code • Completely met deadlines and requirements • Clear communication You are cordially welcome to discuss your project. Thank You! Best Regards, Toriqul Islam
€250 EUR in 4 days
5.5
5.5

Greetings, Thank you for considering my application for this project. As an AI Engineer and Python Developer with over 8+ years of experience, I bring a wealth of knowledge and expertise in the field of Python, Deep Learning. I have carefully reviewed the project description and am eager to discuss your specific needs and requirements in more detail. My commitment is to provide dedicated support and consistent follow-up throughout the project's lifecycle. Please feel free to reach out to me to further discuss how I can contribute to the success of your project. Looking forward to the opportunity of working together. Best regards, KuroKien
€400 EUR in 1 day
5.6
5.6

Hello, I am an AI/ML Consultant with 11+ years of experience in deploying Large Language Models (LLMs) for financial and enterprise applications. I understand your requirement is to implement a fully on-premise, locally deployable LLM system on Windows Server 2019, pre-trained for financial data, capable of prioritizing internal documentation via LoRA/QLoRA, with API and web interface access for real-time queries. -->> Evaluate and deploy suitable open-source LLM (DeepSeek preferred, Mistral 7B as alternative) for Windows server -->> Design on-premise architecture with VM support, internal documentation ingestion, and indexing -->> Implement fine-tuning and RAG mechanisms to prioritize internal knowledge over pre-trained data -->> Provide API and web-based chat interface for impromptu and automated requests -->> Complete documentation, technical training, and post-deployment support for internal team Preliminary Technical Approach: Minimum Hardware Estimate: CPU: 16 cores, RAM: 128 GB, GPU: NVIDIA A100/4090-class for inference, Storage: 2 TB SSD for model + data indexing Required Software: Windows Server 2019, Python 3.11+, CUDA/cuDNN, PyTorch, Transformers, FAISS for vector search, Virtualization tools (Hyper-V) Architecture: LLM inference VM + document ingestion/indexing module + ASP.NET API layer + optional web UI for ad-hoc queries; internal DB (PostgreSQL/SQLite) for metadata Thanks & regards Julian
€375 EUR in 7 days
6.1
6.1

Hi there! I reviewed your project description carefully. Before clarifying all follow-up questions required, if I say my experience, I've already experienced deploying LLama2(7b/13b) models locally and implemented prompt engineering pipelines using hugging face and langchain as well as the training pipeline, so I would deliver the quality work tailored to your requirements. Please contact me so that we can discuss the details further. Thank you, Jijo
€500 EUR in 7 days
5.4
5.4

I am excited to formally submit my bid for this project. Choosing me ensures a partner with proven expertise in AI and LLM deployment—you will benefit from my extensive experience, including top-tier contributions in Bittensor subnets and advanced on-premise AI implementations. @Technical Approach@ Model Selection: I recommend DeepSeek (preferred) and Mistral 7B (quantized) on Windows Server 2019 with GPU acceleration. Financial domain specialization will be achieved via QLoRA fine-tuning. Architecture: A fully offline RAG-based solution comprising: Core LLM service (GPU VM) Vector database (FAISS or Milvus) Document ingestion & embedding pipeline API layer (REST for ASP.NET integration) Internal web interface for ad hoc queries Intranet Prioritization: Retrieval-first pipeline with strict grounding rules ensures internal documentation is prioritized. LoRA fine-tuning reinforces organizational terminology. Minimum Hardware (baseline): CPU 16–24 cores, RAM 128GB, GPU NVIDIA 48GB VRAM (A6000/L40), Storage 2TB NVMe SSD @Deliverables@ Full architecture documentation, installation scripts, team training, knowledge transfer, and performance validation. I am available for daily remote meetings from March, committed to milestone delivery, full offline compliance, and high-quality implementation. I look forward to driving this transformative financial AI initiative.
€350 EUR in 3 days
5.4
5.4

Hello, Your project is very well defined and aligns closely with the type of on-premise LLM deployments I work on, especially in secure environments where cloud access is not permitted. Building a fully offline financial-domain AI system on Windows Server using open-source models such as DeepSeek or Mistral 7B, combined with RAG and LoRA/QLoRA, is a solid approach to ensure that internal documentation is prioritized over general pretrained knowledge. My approach would focus on creating a modular architecture where the LLM inference engine, document ingestion pipeline, vector database, and API layer are clearly separated. Internal documents from your intranet would be processed through an embedding pipeline and indexed in a vector database, enabling Retrieval Augmented Generation so the model prioritizes internal information. Fine-tuning (LoRA/QLoRA) can further adapt the model to the financial context. The system will run fully offline, with optimized GPU utilization and controlled resource allocation within your Windows Server 2019 + VM environment. I will also provide the requested technical documentation, hardware estimation, architecture design, and a detailed day-by-day activity plan, along with training for your internal team to maintain and extend the system. Integration with your ASP.NET application via API/WebService will be straightforward, and I can also implement a secure internal chat interface for direct queries similar to ChatGPT.
€400 EUR in 7 days
6.2
6.2

Rome, Italy
Payment method verified
Member since Aug 24, 2018
€8-30 EUR
€8-30 EUR
€30-250 EUR
€8-30 EUR
€8-30 EUR
₹1500-12500 INR
₹12500-37500 INR
$2-8 USD / hour
$15-25 USD / hour
$250-750 USD
£250-750 GBP
£50-69 GBP / hour
₹12500-37500 INR
€250-750 EUR
$250-750 USD
₹1500-12500 INR
$2-8 USD / hour
£250-750 GBP
$250-750 USD
$30-250 SGD
₹1500-12500 INR
₹1500-12500 INR
$10-30 USD
$30-250 USD
$15-25 USD / hour