PinnedPublished inTowards AIGradient Descent OptimizationGradient Descent Optimization AlgorithmsOct 18, 2022Oct 18, 2022
PinnedPublished inTDS ArchiveIntriguing Properties of Neural NetworksHow do Neural Nets Work?May 18, 2022May 18, 2022
PinnedPublished inTDS ArchiveQRNN: A Potential Competitor to the TransformerTraining Faster RNNs with Quasi-RNNOct 7, 20201Oct 7, 20201
PinnedPublished inTDS ArchiveA Comprehensive Guide to Generative Adversarial Networks (GANs)Generating Meaningful Data from NoiseMay 16, 2020May 16, 2020
Published inTDS ArchiveUnderstanding LoRA Part I: Exploring Intrinsic DimensionsEfficient fine-tuning techniques for Language ModelsOct 31, 20241Oct 31, 20241
Published inThe StartupMy 2-Time Google Foobar ExperienceHow I did at the Google Foobar Challenge, Twice!Jan 26, 20211Jan 26, 20211
Published inTDS ArchiveGPT-3 ExplainedUnderstanding Transformer-Based Self-Supervised ArchitecturesJan 12, 2021Jan 12, 2021
Published inTDS ArchiveDynamic Programming in RLTowards Training Better Reinforcement Learning AgentsDec 30, 2020Dec 30, 2020
Published inTDS ArchiveLongformer: The Long-Document TransformerUnderstanding Transformer-Based Self-Supervised ArchitecturesDec 1, 20201Dec 1, 20201
Published inTDS ArchiveOptimizing Model Training with TensorFlow ProfilerOptimizing GPU Performance with TensorFlowNov 13, 2020Nov 13, 2020