
Closed
Posted
Paid on delivery
Engenheiro de IA Sênior - Otimização de Pipeline de Longo Contexto (Python) Descrição do Projeto: Estamos buscando um especialista em IA para destravar e otimizar nosso pipeline de análise de matrículas imobiliárias. O sistema é construído em Python e utiliza o Gemini 1.5 Pro/Flash para processar documentos que podem chegar a centenas de páginas. O foco é garantir que o modelo processe o contexto integral sem truncamento e com rigor lógico. Principais Desafios e Responsabilidades: * Sanitização e Ingestão: Refinar a extração de texto (OCR/Parsing) para evitar estouro de buffers e garantir que o texto completo chegue à IA via Python/MySQL (uso de LONGTEXT). * Gestão de Tokens e Cotas: Implementar estratégias de Rate Limiting e gerenciamento de Tokens per Minute (TPM) para evitar erros 429 nas APIs do Google Cloud. * Context Window Strategy: Aplicar técnicas de estruturação (XML Tagging/Chunking) para que a IA analise cadeias dominiais complexas em documentos de 100+ páginas sem se perder. * Prompt Engineering de Alta Fidelidade: Refinar instruções de sistema para garantir que a IA mantenha a "Auditoria Sequencial" e não resuma informações críticas. Requisitos Técnicos: * Domínio de Python: Experiência com bibliotecas de manipulação de dados e integração de APIs. * Google Cloud & Vertex AI: Experiência prática em gerenciar cotas e limites no Paid Tier do Google. * Arquitetura de Dados: Conhecimento em persistência de grandes volumes de texto (MySQL/Redis) e processamento assíncrono (Celery/Workers).
Project ID: 40210771
8 proposals
Remote project
Active 1 mo ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
8 freelancers are bidding on average $125 USD for this job

Having led and executed a multitude of complex projects centered around both the Python language as well as data manipulation and API integration, I am confident that my skills and experience align perfectly with the challenges your project presents. From my extensive knowledge of using OCR, parsing, and MySQL/Redis for extensive text storage, to successfully handling storage issues for large-scale entities, I believe I can contribute significantly in optimizing your pipeline. My proficiency in Python is well complemented by experience with various data management libraries and Google Cloud/Vertex AI services. Handling rate limiting API services to prevent session interruptions for clients is one of my key strengths. Through effective prompt engineering and establishing context window strategies, I have ensured valuable data isn't truncated nor lost amidst crucial long papers, something that's vital for your project as well. Moreover, I am distinguished by a proactive attitude towards value addition. Apart from delivering results with precision and efficiency, I add value by analyzing deeply for potential risks while continuously aligning myself with client's goals. It would be an absolute pleasure to bring this approach as well as my broader skillset to optimize your pipeline and deliver a rock-solid solution that meets your needs on time and within budget. Looking forward to taking it up!
$155 USD in 1 day
6.9
6.9

Dear Hiring Manager, I’m pleased to submit my proposal for the Senior AI Engineer role focused on long-context pipeline optimisation. Your challenge of reliably processing large-scale real estate registry documents with strict logical consistency strongly aligns with my experience in production-grade AI systems and LLM orchestration. My approach would focus on making the pipeline robust end to end—from ingestion to model reasoning—ensuring full-context integrity, predictable API behaviour, and auditable outputs even with documents exceeding hundreds of pages. ➡ Refinement of OCR and text-parsing pipelines to ensure clean, complete ingestion without buffer overflow or silent truncation ➡ Reliable handling of large text payloads in Python with proper use of MySQL LONGTEXT and supporting data stores ➡ Token and quota management strategies to prevent Google Cloud API 429 errors, including rate limiting and request orchestration ➡ Design of effective context-window strategies using structured chunking and XML-style tagging for long-chain document reasoning ➡ High-fidelity prompt engineering to enforce sequential audit logic and prevent loss or summarisation of critical legal information ➡ Hands-on experience integrating and operating Gemini models via Google Cloud / Vertex AI in paid-tier environments ➡ Strong Python background with data processing, API integration, and asynchronous architectures Best Regards, Mayank Saluja
$125 USD in 5 days
5.3
5.3

As an experienced Software Engineer, adept in AI and Python, I am well-positioned to unlock and optimize your complex real estate pipeline. Throughout my 9+ years in web and mobile app development, I have consistently harnessed the power of languages like Python, PHP, JavaScript and leveraged their respective libraries to build efficient systems. My familiarity with prominent data manipulation tools and APIs means I won't just sanitize and ingest text into your system effectively, but also implement robust strategies to manage rate limits via skills honed on platforms such as Google Cloud. Over the course of my career, I've gained significant proficiency in handling large volumes of text using MySQL/Redis, ensuring a reliable architecture of data persistence that aligns ideally with your project's demands. Additionally, my expertise in processing asynchronous tasks utilizing Celery/Workers highlights my suitability for your job. Furthermore, having worked extensively on User Interfaces for 100+ page documents across a gamut of industries, I can bring a fresh perspective to your pipeline-context window strategy as well. In conclusion, what sets me apart is not just my demonstrated skill-set but also my dedication to "auditing sequentially" and maintaining the holistic integrity of information: a quality that will undeniably elevate YOUR project's success quotient!
$130 USD in 7 days
5.5
5.5

Hi, there, Sou um Engenheiro de IA Sênior especializado em otimização de pipelines de análise de dados em Python. Com vasta experiência em sanitização de dados, gerenciamento de tokens, e prompt engineering, estou preparado para enfrentar os desafios do seu projeto. Como exemplo, otimizei com sucesso a ingestão de dados em projetos anteriores, garantindo a integridade dos textos. ✅ Utilizarei estratégias de Rate Limiting para evitar erros nas APIs do Google Cloud. ✅ Implementarei uma estratégia de Context Window com XML Tagging para análise de documentos extensos. ✅ Refinarei as instruções do sistema para manter a Auditoria Sequencial e evitar resumos de informações críticas. ✅ Gerenciarei eficientemente as cotas e limites no Google Cloud, priorizando a estabilidade do sistema. Estou animado para colaborar em seu projeto desafiador.
$145 USD in 1 day
4.2
4.2

Welcome to professional Python development services! Hi there, I'm Alema, a Python expert programmer who strives for clear code in atmospheric, numerical weather prediction, physics, and all other seminal fields. I'm ready to provide you with high-quality services. I have completed 350+ projects with a 100% Positive Rating. If you are looking for Quality work, look no further. Also, we are a team of professional workers, and we are always available 24/7 to help employers without limitations, and delivery is guaranteed on time. Your faithfully. Eng. Alema Akter
$15 USD in 1 day
2.5
2.5

Olá, Sr Sou um engenheiro de IA sênior com vasta experiência em otimização de pipelines de análise de contexto extenso em Python. Já desenvolvi projetos de processamento de documentos em larga escala, envolvendo sanitização de OCR, ingestão estruturada e persistência usando armazenamento LONGTEXT. Aplicarei estratégias de janela de contexto, como marcação XML e fragmentação inteligente, para preservar a lógica completa do documento em centenas de páginas. Caso surjam problemas de truncamento, sequenciamento ou raciocínio, diagnosticarei a causa raiz e aplicarei correções precisas no prompt e no pipeline. Minha ampla experiência inclui trabalho com Google Cloud, cotas do Vertex AI e processamento assíncrono usando workers e filas. Essa abordagem atenderá à sua necessidade de auditoria sequencial de alta fidelidade de documentos imobiliários complexos, sem perda de contexto crítico. Atenciosamente, Ahmed
$200 USD in 7 days
0.0
0.0

São Paulo, Brazil
Member since Oct 18, 2025
$30-250 USD
$15-25 CAD / hour
$10-30 USD
$30-250 CAD
$30-250 USD
₹1500-12500 INR
$15-25 USD / hour
₹12500-37500 INR
£250-750 GBP
$30-250 USD
₹12500-37500 INR
$10000-20000 CAD
$250-750 USD
€80 EUR
₹750-1250 INR / hour
£1500-3000 GBP
$750-1500 USD
₹75000-150000 INR
$250-750 USD
$250-750 USD
₹12500-37500 INR