Skip to content
View ElkinStas's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report ElkinStas

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ElkinStas/README.md

Stanislav Elkin

Senior AI/ML Engineer · Production LLM/RAG · NLP · AI Security · Agentic Systems

I build production-oriented AI systems that move beyond demos: retrieval workflows, LLM/RAG applications, NLP classifiers, evaluation pipelines, inference optimization, and agentic systems with measurable behavior.

My strongest areas are applied AI engineering, production NLP, retrieval-augmented systems, adversarial/deceptive-content detection, and reliable ML deployment.


Current Focus

  • Production LLM/RAG systems over heterogeneous documents and knowledge sources
  • Agentic AI workflows with controlled tool use, structured outputs, and evaluation
  • Scam, fraud, and social-engineering detection in messaging and communication data
  • Retrieval quality, reranking, prompt design, LLM evaluation, and release validation
  • AI security research: adversarial robustness and behavior-based detection

Technical Stack

Area Tools & Experience
LLM / RAG LangGraph, LangChain-style workflows, Anthropic Claude, OpenAI API, RAG evaluation, LLM-as-judge, prompt design
Retrieval & Search BM25, dense embeddings, reranking, FAISS, Elasticsearch, Milvus, Redis, hybrid retrieval
Machine Learning PyTorch, Hugging Face, BERT/T5, classifiers, detection systems, evaluation protocols
Agentic AI Multi-agent workflows, controlled tool invocation, graph memory, typed state, reasoning traces
Backend / Infra Python, FastAPI, Docker, CI/CD, pytest, REST APIs, async workflows
Optimization ONNX, TensorRT, Triton, inference latency and throughput optimization
Knowledge Graphs NetworkX, RDF/OWL, SPARQL, ontology design, entity extraction, relationship modeling
Data Pandas, NumPy, SQL, data cleaning, dataset construction, synthetic data, hard negatives

Selected Impact

  • Built production-oriented NLP/LLM systems for document intelligence, retrieval, and real-time deceptive-content detection.
  • Improved ML inference performance with Triton, ONNX, and TensorRT: 6× throughput improvement and around 40% lower latency.
  • Built a retrieval-augmented LLM agent for cybersecurity reasoning that improved task completion by 57.1% over a GPT-4o baseline on a CTF benchmark.
  • Published research on retrieval-augmented LLM agents for adversarial multi-step task automation at IEEE FLLM 2025.
  • Published work on NLP systems for production speech pipelines at Springer AIST 2024.
  • Mentored 12 engineers/students on production ML system design, deployment, monitoring, testing, and MLOps practices.

Featured Projects

Multi-agent cyber threat intelligence system that discovers cybersecurity incidents from the open web, extracts structured intelligence, enriches referenced CVEs through the NVD API, and serves consolidated JSON/Markdown reports via an async FastAPI service.

The pipeline is built as a LangGraph StateGraph with explicit agent-to-agent state transitions: search, extraction, post-processing, CVE enrichment, and report synthesis. The system includes Claude-based structured extraction, heuristic fallbacks, retry/backoff logic, per-node telemetry, caching, incident-to-CVE linkage, Docker setup, Streamlit UI, and a test suite that runs without network access or API keys.

Focus: LangGraph, FastAPI, Anthropic Claude, multi-agent workflows, structured outputs, CVE enrichment, telemetry, caching, Docker, Streamlit, pytest.


RL training task for teaching LLM agents to generate graphs that satisfy precise network topology constraints. The model must produce a connected 400-node graph matching target clustering coefficient, average path length, and connected-component constraints under limited tool-use and metric-call budgets.

What makes the task interesting is that there is no simple one-shot solution: the agent has to choose between graph generation strategies such as Erdős-Rényi, Barabási-Albert, and Holme-Kim triadic closure, then iteratively refine the graph through rewiring, triangle-boosting, and edge pruning. The benchmark is intentionally difficult because clustering and path length often trade off against each other, forcing the model to reason under constraints rather than simply generate code.

Focus: Python, NetworkX, reinforcement learning tasks, graph generation, constrained optimization, network metrics, tool-use budgets, LLM training evaluation.


Research codebase behind an IEEE FLLM 2025 paper on retrieval-augmented reasoning for offensive-security and CTF-style tasks. The project explores how an LLM agent can combine retrieval, graph-based memory, controlled tool routing, and step-by-step reasoning to solve multi-stage cybersecurity challenges more reliably than a raw LLM baseline.

The system uses an open-source LLM reasoning core, RAG over task-relevant data, NetworkX-based graph memory, and integrations with tools such as DirSearch, Nmap, SQLmap, Hashcat, and CyberChef. The dataset covers reconnaissance, forensics, cryptography, low-level exploitation, binary analysis, and reverse engineering tasks.

Focus: LLM agents, RAG, AI security, CTF automation, graph-based memory, tool routing, cybersecurity reasoning, IEEE FLLM 2025.


Publications


About Me

I’m based in Brazil and work well in remote, async, cross-functional environments. I enjoy roles where AI is not just a prototype, but a reliable product component: measurable, testable, useful, and maintainable.

I’m especially interested in applied AI products, AI security, retrieval systems, developer tools, document intelligence, knowledge systems, and agentic workflows.


Pinned Loading

  1. cti-multi-agent cti-multi-agent Public

    Multi-agent cyber threat intelligence system — discovers incidents from the open web, extracts structured intelligence via LLM, enriches CVEs through NVD, and serves consolidated reports via async …

    Python

  2. graph-generation-rl-task graph-generation-rl-task Public

    LLM agent training/evaluation task for graph generation with topology constraints, reward-style scoring, tool-use budgets, and iterative refinement.

    Python

  3. CTF_Assistant CTF_Assistant Public

    Forked from KirillAn/CTF_Assistant

    Research codebase for an IEEE FLLM 2025 paper: retrieval-augmented LLM agent for CTF challenge solving with RAG, graph memory, tool routing, and reasoning traces.

    Python