GenAI Systems Security Evaluation Expert

Log In Sign Up
Design, implement, and execute test approaches to GenAI systems Chatbot) to identify security flaws, particularly those impacting confidentiality, integrity, or availability of information. Perform various types of tests such as functional testing, regression testing, performance testing, and usability testing to evaluate the behavior and performance of the AI algorithms and models. Create, implement, and execute test plans and strategies for evaluating AI systems, including defining test objectives, selecting suitable testing methods, and identifying test scenarios. Document test methods, results, and suggestions in clear and brief reports to stakeholders. Perform security assessments including creating updating and maintaining threat models and security integration of Gen AI platforms. Ensure that security design and controls are consistent with security architecture principals. Design security reference architectures and implement/configure security controls with an emphasis on GenAI technologies. Provide AI security architecture and design guidance as well as conduct full-stack architecture reviews of software for GenAI systems and platforms. Serve as a subject matter expert on information security for GenAI systems and applications in cloud/vendor and on-prem environments. Discuss AI/ML concepts proficiently with data science and ML teams to identify and develop solutions for security issues. Collaborate with engineering teams to perform advanced security analysis on complex GenAI systems, identifying gaps and contributing to design solutions and security requirements. Identify and document defects, irregularities or inconsistencies in AI systems and working closely with developers to rectify and resolve them. Ensure the quality, consistency and relevance of data used for training and testing AI models (includes collecting, preprocessing and validating data) Assess AI systems for ethical considerations and potential biases to make sure they follow ethical standards and encourage inclusivity and diversity. Collaborate with diverse teams including developers, data scientists, and domain experts to understand requirements validate assumptions and align testing efforts with project goals. Conducting research to identify vulnerabilities and potential failures in AI systems. Design and implement mitigations, detections, and protections to enhance the security and reliability of AI systems. Perform model input and output security including prompt injection and security assurance. Must have Skills: 12+ years of hands-on experience in GenAI and Cybersecurity or Information Security. Must have expertise Design, implement, and execute test approaches to GenAI systems (Chatbot) to identify security flaws, impacting confidentiality, integrity. Expert level experience to Create, implement, and execute test plans and strategies for evaluating AI systems 4+ years of experience programming with demonstrated advanced skills with Python and the standard ML stack (TensorFlow/Torch, NumPy, Pandas, etc.) 4+ years of experience with Natural Language Processing (NLP) and Large Language Models (LLM) desired 4+ years of experience working in Cloud environment (Azure, AWS, Google Cloud Platform) Demonstrated proficiency with AI/ML fundamental concepts and technologies including ML, Deep learning, NLP, and computer vision. Demonstrated expertise in attacking GenAI products and platforms. Demonstrated recent experience with large language models. Demonstrated experience with using AI testing frameworks and tools such as TensorFlow or PyTorch, or Keras Demonstrated ability to write test scripts, automate test cases, and analyze test results using programming languages and testing frameworks listed above. Demonstrated ability to Identify and document defects, irregularities or inconsistencies in AI systems and working closely with developers to rectify and resolve them. Ability to work independently to learn new technologies, methods, processes, frameworks/platforms, and systems. Excellent written and verbal communication skills to articulate challenging technical concepts to both lay and expert audiences. Ability to stay updated on the latest developments, trends, and best practices in both software testing and artificial intelligence. Bachelor s degree in computer science, electrical or computer engineering, statistics, econometrics, or related field, or equivalent work experience Desired Skills: Excellent problem-solving and critical thinking skills with attention to detail in an ever-changing environment. Background in designing and implementing security mitigations and protections and/or publications in the space Currently participating in CTF/GRT/AI Red Teaming events and/or bug bounties developing or contributing to OSS projects. Understanding of ML lifecycle and MLOps. Perform various types of tests such as functional testing, regression testing, performance testing, and usability testing to evaluate the behavior and performance of the AI algorithms and models Ability to ensure the quality, consistency and relevance of data used for training and testing AI models (includes collecting, preprocessing and validating data) Ability to assess AI systems for ethical considerations and potential biases to make sure they follow ethical standards and encourage inclusivity and diversity Ability work in and provide technical leadership to cross-functional teams to develop and implement AI/ML solutions, including capabilities that leverage LLM technology
Project ID: 39908243
16 proposals
Open for bidding
Remote project
Active 3 mins ago Bid amount Email address
Set your budget and timeframe
It’s free to sign up and bid on jobs
16 freelancers are bidding on average $14 USD for this job
Hello, With over 7 years of experience in Machine Learning (ML) and Data Science, I have carefully reviewed the requirements for the GenAI Systems Security Evaluation Expert project. To address the project needs, I propose to design and implement comprehensive test approaches for GenAI systems, including functional, regression, performance, and usability testing. I will create detailed test plans and strategies, documenting results in clear reports. Additionally, I will conduct security assessments, update threat models, and integrate security controls to ensure confidentiality, integrity, and availability of information. Furthermore, I will collaborate with engineering teams to analyze complex systems, identify vulnerabilities, and propose solutions. I will also ensure the quality of data used for training AI models and assess systems for ethical considerations and biases. My expertise in AI/ML concepts, programming skills in Python, and experience with NLP and Cloud environments make me well-equipped for this project. I would appreciate the opportunity to discuss this project further in a chat to provide a tailored solution. You can visit my profile at https://www.freelancer.com/u/HiraMahmood4072 Thank you.
$20 USD in 7 days
$10 USD in 30 days
$10 USD in 2 days
Hi Mate , Good evening! I’ve carefully checked your requirements and really interested in this job. I’m full stack node.js developer working at large-scale apps as a lead developer with U.S. and European teams. I’m offering best quality and highest performance at lowest price. I can complete your project on time and your will experience great satisfaction with me. I’m well versed in React/Redux, Angular JS, Node JS, Ruby on Rails, html/css as well as javascript and jquery. I have rich experienced in Usability Testing, Cloud Computing, Google Cloud Platform, Machine Learning (ML), Data Science, Azure, Regression Testing, Test Automation, Natural Language Processing and Large Language Models (LLMs). For more information about me, please refer to my portfolios. I’m ready to discuss your project and start immediately. Looking forward to hearing you back and discussing all details.. Looking forward to serve you
$10 USD in 4 days
Hello GenAI Systems Security Evaluation Expert, I am excited to submit a proposal for your project requiring expertise in designing, implementing, and executing test approaches for GenAI systems, particularly focusing on security flaws affecting confidentiality, integrity, and availability of information. My approach will involve conducting various tests like functional testing, regression testing, performance testing, and usability testing to evaluate AI algorithms and models thoroughly. I will create and execute comprehensive test plans, document results in clear reports, and collaborate with diverse teams to align testing efforts with project goals. Moreover, I will ensure the quality and relevance of data used for training and testing AI models, assess systems for ethical considerations and biases, and design security architectures with an emphasis on GenAI technologies. Can you please provide more details on your timeline and budget expectations for this project? I propose a budget range of $10,000 – $15,000 for this project with an estimated duration of 4-6 weeks. Looking forward to the opportunity to work on this exciting project with you. Thanks, John
$25 USD in 3 days
Hi Yamini375, I understand that you’re looking for a professional to design and implement comprehensive testing approaches for GenAI systems, specifically focusing on identifying security flaws that impact the confidentiality, integrity, and availability of information. My extensive experience enables me to effectively execute functional, regression, and performance testing while documenting findings for stakeholders. I am Syed, and I bring over 10 years of hands-on experience in Cloud Computing, Test Automation, and Cybersecurity, particularly within AI systems. My expertise spans Natural Language Processing, Machine Learning, and conducting thorough security assessments, ensuring compliance with best practices and ethical standards in AI. Please feel free to review my portfolio here: https://www.freelancer.com/u/syeds273 I look forward to the opportunity to discuss how I can contribute to the success of your project. Thanks, Regard, Syed
$10 USD in 1 day
Hi there, How are you? My name is Brandon and I am from United States. I’ve read your brief of GenAI Systems Security Evaluation Expert and I’m confident I can deliver exactly what you need—on time and on budget. With 7+ years of full-stack freelance experience, I’ve built, designed, written, automated, and optimised everything from one-page sites to multi-platform SaaS, e-commerce stores, ML pipelines, brand identities, and 7-figure ad campaigns. Also I am familiar with your requirements Cloud Computing, Data Science, Natural Language Processing, Regression Testing, Machine Learning (ML), Azure, Usability Testing, Google Cloud Platform, Test Automation and Large Language Models (LLMs). What you get if you choose me: 1. Zero hand-holding: I ask the right questions up-front, propose improvements, and keep you updated daily. 2. Clean, scalable, well-documented deliverables—whether that’s code, copy, graphics, datasets, or ad creatives. 3. Milestone-based workflow: you see real progress every 24–48 h, pay only when satisfied. 4. Post-delivery support: 30 days free bug fixes / tweaks, lifetime email support. Next step: message me the brief or any files, I’ll reply with a bullet-proof execution plan + fixed price/ETA within 30 minutes. If my proposal beats your expectations, award and we start immediately; if not, you owe nothing. Ready to knock this out—let’s talk. Best, Brandon Rodriguez
$10 USD in 3 days
$21 USD in 7 days
$10 USD in 30 days
Hello, I am interested in the GenAI Security and Testing role. I have more than twelve years of experience working in cybersecurity and AI, focusing on testing and securing large language models, NLP systems, and GenAI chatbots. My work involves designing and implementing complete testing strategies for AI systems, including functional, regression, performance, and usability testing. I also perform advanced security assessments to identify vulnerabilities that may affect the confidentiality, integrity, or availability of information in GenAI platforms. I have hands-on experience developing and executing test plans for AI models using Python, TensorFlow, PyTorch, and related frameworks. My background includes evaluating model performance, detecting prompt injection and data poisoning threats, and ensuring the quality and ethical integrity of datasets used for training and testing. I also design and maintain threat models and secure architectures for AI systems deployed in AWS, Azure, and Google Cloud environments. I would be happy to discuss how I can contribute to your project and provide a detailed testing and security plan tailored to your GenAI systems. Let’s chat!
$10 USD in 1 day
$10 USD in 30 days
$10 USD in 30 days
Hi there! I understand how critical it is to ensure GenAI systems are secure, reliable, and ethically sound while identifying vulnerabilities that could impact confidentiality, integrity, or availability. You need expert-level testing, assessment, and mitigation to protect AI platforms from real-world security threats. I will design and execute comprehensive test strategies for your GenAI chatbots, perform functional, regression, and performance testing, and implement security controls aligned with best practices. I’ll also review AI architectures, assess bias and data quality, and provide actionable reports with mitigation plans for developers to follow. Do you want the evaluation focused on cloud-based, on-prem, or both deployment environments? Start /Open chat now.
$20 USD in 7 days
Hi there, This project perfectly aligns with my background in AI security, testing, and GenAI systems architecture. With over 7 years of hands-on experience in AI/ML and cybersecurity, I specialize in evaluating and hardening LLM-based platforms, ensuring confidentiality, integrity, and availability across both cloud and on-prem environments. Here’s what I’ll bring to your project: Design and execute comprehensive test plans for GenAI models (chatbots, LLMs, NLP systems). Perform functional, regression, performance, and security testing using frameworks like PyTorch, TensorFlow, and pytest. Conduct prompt-injection, data leakage, and adversarial testing to identify vulnerabilities and strengthen AI resilience. Build and maintain threat models, design security reference architectures, and ensure alignment with Azure, AWS, and GCP best practices. Provide clear, actionable reports detailing findings, mitigation strategies, and architectural recommendations. I focus on clean automation, deep AI model understanding, and practical security integration that scales. I’d love to discuss your GenAI testing scope and propose a strategic roadmap for securing your platform. Best regards,
$20 USD in 7 days
Queens, United States
Member since Oct 23, 2025
US (International) / English
Freelancer
About
Terms
Partners
Apps
Registered Users
Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2025 Freelancer Technology Pty Limited (ACN 142 189 759)