PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications
Enhancing PixelCNN using discretized logistic mixture likelihood and modifications
How to Build an Exchange
Building an exchange and managing user demand with upcoming talks
A brief trip through Spacetime
An introduction to using Spacetime, a new memory profiling facility for OCaml to detect space leaks and unwanted allocations.
Faulty reward functions in the wild
Understanding and Mitigating Challenges with Faulty Reward Functions in Real-world Applications
Lessons in resilience at SoundCloud
Discussion on the circuit breaker pattern and client-side load balancing as essential aspects of resiliency in RPC at scale.
Universe
Exploring the vast realms of the universe through a technical lens.
#Exploration: A study of count-based exploration for deep reinforcement learning
Investigating count-based exploration techniques in deep reinforcement learning
OpenAI and Microsoft
Exploring the collaboration between OpenAI and Microsoft in the realm of artificial intelligence.
On the quantitative analysis of decoder-based generative models
Exploring the quantitative analysis of decoder-based generative models
A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models
Exploring the synergy of generative adversarial networks, inverse reinforcement learning, and energy-based models
RL²: Fast reinforcement learning via slow reinforcement learning
Optimizing reinforcement learning with RL² approach
Variational lossy autoencoder
Exploring the variational lossy autoencoder for enhanced compression and representation learning.