Efficient Modeling | Jiuxiang Gu

My research in efficient modeling focuses on creating more efficient neural network architectures that can achieve better performance with reduced computational requirements. This work is crucial for deploying AI models in resource-constrained environments and making them more accessible.

Balancing model performance with computational efficiency through architectural innovations and optimization techniques.

Research Focus Areas

Model Compression

Developing techniques to reduce model size while maintaining performance, including:

Pruning: Removing unnecessary connections or neurons
Quantization: Reducing precision of weights and activations
Knowledge Distillation: Transferring knowledge from larger models to smaller ones

Architectural Innovations

Designing new model architectures that are inherently more efficient:

Attention Mechanisms: Optimizing attention computations
Sparse Networks: Leveraging sparsity for efficiency
Neural Architecture Search: Automatically finding efficient architectures

Training Efficiency

Improving the training process itself:

Curriculum Learning: Training on progressively harder examples
Mixed Precision Training: Using lower precision during training
Gradient Accumulation: Handling larger effective batch sizes

Various techniques for improving model efficiency and the metrics used to evaluate performance.

Impact

This research has significant implications for:

Mobile and Edge Computing: Enabling AI models to run on devices with limited resources
Real-time Applications: Reducing inference time for time-sensitive applications
Environmental Impact: Lowering the carbon footprint of AI training and inference
Accessibility: Making advanced AI capabilities available to more users and organizations

The work contributes to both theoretical understanding of efficient neural networks and practical applications in Adobe’s creative tools, where efficiency is crucial for providing responsive user experiences.

Research Focus Areas

Model Compression

Architectural Innovations

Training Efficiency

Impact

References