AI/ML Engineer - Python/PyTorch

Arting Digital


Date: 13 hours ago
City: Hyderabad, Telangana
Contract type: Full time
Job Title - AI/ML Engineer

Experience - 6+ years

Location - Hyderabad, Vizag

Roles And Responsibilities

Primary Skills - Python, PyTorch, TensorFlow, LLM Models

  • Data Collection, Preprocessing, and Management :

Data Collection : Gather diverse datasets for training LLMs or other machine learning models. This may include text data from various sources like web scraping, databases, or APIs.

Data Preprocessing : Clean and preprocess raw text data (e.g., tokenization, stemming, lemmatization, removing stopwords) for NLP tasks using tools such as SpaCy, NLTK, or custom preprocessing pipelines.

Data Augmentation : Create synthetic data or perform augmentation techniques to enrich training datasets, particularly in scenarios where large labeled datasets are scarce.

Data Pipeline Development : Build automated and scalable data pipelines for continuous data ingestion, cleaning, and feeding into models.

  • Building and Training Machine Learning Models :

Model Selection and Design : Design and implement deep learning architectures for specific use cases (e.g., LLMs for NLP tasks like sentiment analysis, text summarization, question answering).

Model Development Using PyTorch and TensorFlow : PyTorch : Build and train custom neural networks using PyTorch, leveraging its dynamic computation graph and flexibility for research and experimentation.

TensorFlow : Implement scalable, production-ready models using TensorFlow (including TensorFlow Hub and Keras for high-level model building).

Training Large Models : Train large models like transformers (e.g., BERT, GPT, T5) using large-scale datasets. Efficiently handle high computational requirements for these models, potentially using cloud services (AWS, GCP) or GPUs.

Fine-Tuning Pre-trained Models : Fine-tune pre-trained models like BERT, GPT-3, or other LLMs on task-specific data to improve performance on downstream applications.

Model Evaluation : Use evaluation metrics like accuracy, F1 score, BLEU score (for text generation), or perplexity to assess model performance. Perform cross-validation and hyperparameter optimization.

  • Model Optimization and Scaling :

Hyperparameter Tuning : Experiment with hyperparameters (e.g., learning rates, batch sizes, number of layers, dropout rates) to enhance model performance and prevent overfitting.

Optimization : Use model optimization techniques such as quantization, pruning, and knowledge distillation to reduce the size and improve the inference speed of large models.

Distributed Training : Implement distributed training using PyTorch Distributed or TensorFlow's MirroredStrategy to train large models efficiently across multiple GPUs/TPUs.

  • Model Deployment and Integration :

Model Deployment : Deploy AI/ML models into production environments (e.g., AWS SageMaker, Google AI Platform) ensuring scalability, security, and robustness.

API Development : Build APIs or microservices for serving models, enabling real-time predictions or batch processing using frameworks like Flask, FastAPI, or TensorFlow Serving.

Model Monitoring : Implement monitoring systems to track the performance and accuracy of models in production. Detect model drift or degradation over time and retrain when necessary.

Scalability and Optimization : Ensure that the models can scale to handle large-scale inference workloads. Use TensorFlow Lite for edge devices or ONNX for cross-framework deployment.

  • Research and Experimentation :

Cutting-Edge Research : Stay up to date with the latest advancements in machine learning, especially in transformer models and NLP, and incorporate state-of-the-art techniques into your work.

Innovation : Experiment with novel approaches for improving model accuracy, efficiency, or generalization (e.g., new transformer variants, unsupervised pretraining techniques).

Contributing to Open Source : Contribute to or develop open-source projects that enhance machine learning tools, especially in the field of NLP and LLMs.

(ref:hirist.tech)

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.

Post a resume

Similar jobs

Corporate Accounting

Celanese, Hyderabad, Telangana
13 hours ago
Celanese Corporation is a global chemical leader in the production of differentiated chemistry solutions and specialty materials used in most major industries and consumer applications. Our businesses use the full breadth of Celanese's global chemistry, technology and commercial expertise to create value for our customers, employees, shareholders and the corporation. As we partner with our customers to solve their most...

BSP Reconciliation Operations Specialist

Franklin Templeton India, Hyderabad, Telangana
13 hours ago
Benefit Street Partners (BSP) is owned by Franklin Templeton, a diversified firm that spans asset management, wealth management, and fintech, giving us many ways to help investors make progress toward their goals. With clients in over 150 countries and offices on six continents, you’ll get exposed to different cultures, people, and business development happening around the world.Benefit Street Partners operates...

Senior Product Manager, Uber Direct Growth

Uber, Hyderabad, Telangana
17 hours ago
About The RoleAs a Senior Product Manager for B2B Partner Growth for Uber Direct , you will be building one of Uber's fastest growing independent lines of business that provides small to large enterprise businesses around the world with access to Uber's best in class logistics network so they can offer on-demand local delivery to their customers.You will be responsible...