Huggingface Blog
Join the Hugging Face community for the latest AI updates, tutorials, and industry insights.
1.
Evaluating Audio Reasoning with Big Bench Audio
December 20, 2024 00:00:00
2.
Finally, a Replacement for BERT: Introducing ModernBERT
December 19, 2024 00:00:00
3.
Bamba: Inference-Efficient Hybrid Mamba2 Model
December 18, 2024 00:00:00
4.
Welcome the Falcon 3 Family of Open Models!
December 17, 2024 00:00:00
5.
Benchmarking Language Model Performance on 5th Gen Xeon at GCP
December 17, 2024 00:00:00
6.
Introducing the Synthetic Data Generator - Build Datasets with Natural Language
December 16, 2024 00:00:00
7.
LeMaterial: an open source initiative to accelerate materials discovery and research
December 10, 2024 00:00:00
8.
Hugging Face models in Amazon Bedrock
December 9, 2024 00:00:00
9.
Open Preference Dataset for Text-to-Image Generation by the ๐ค Community
December 9, 2024 00:00:00
10.
Welcome PaliGemma 2 โ New vision language models by Google
December 5, 2024 00:00:00
11.
โHow good are LLMs at fixing their mistakes? A chatbot arena experiment with Keras and TPUs
December 5, 2024 00:00:00
12.
Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard
December 4, 2024 00:00:00
13.
Investing in Performance: Fine-tune small models with LLM insights - a CFM case study
December 3, 2024 00:00:00
14.
Rearchitecting Hugging Face Uploads and Downloads
November 26, 2024 00:00:00
15.
SmolVLM - small yet mighty Vision Language Model
November 26, 2024 00:00:00
16.
You could have designed state of the art positional encoding
November 25, 2024 00:00:00
17.
Letting Large Models Debate: The First Multilingual LLM Debate Competition
November 20, 2024 00:00:00
18.
From Files to Chunks: Improving Hugging Face Storage Efficiency
November 20, 2024 00:00:00
19.
Faster Text Generation with Self-Speculative Decoding
November 20, 2024 00:00:00
20.
Introduction to the Open Leaderboard for Japanese LLMs
November 20, 2024 00:00:00
21.
Judge Arena: Benchmarking LLMs as Evaluators
November 19, 2024 00:00:00
22.
Open Source Developers Guide to the EU AI Act
December 2, 2024 00:00:00
23.
Share your open ML datasets on Hugging Face Hub!
November 12, 2024 00:00:00
24.
Hugging Face + PyCharm
November 5, 2024 00:00:00
25.
Argilla 2.4: Easily Build Fine-Tuning and Evaluation datasets on the Hub โ No Code Required
November 4, 2024 00:00:00
26.
Universal Assisted Generation: Faster Decoding with Any Assistant Model
October 29, 2024 00:00:00
27.
Expert Support case study: Bolstering a RAG app with LLM-as-a-Judge
October 28, 2024 00:00:00
28.
Hugging Face Teams Up with Protect AI: Enhancing Model Security for the Community
October 22, 2024 00:00:00
29.
A Deepdive into Aya Expanse: Advancing the Frontier of Multilinguality
October 24, 2024 00:00:00
30.
Introducing SynthID Text
October 23, 2024 00:00:00
31.
Introducing HUGS - Scale your AI with Open Models
October 23, 2024 00:00:00
32.
CinePile 2.0 - making stronger datasets with adversarial refinement
October 23, 2024 00:00:00
33.
Transformers.js v3: WebGPU support, new models & tasks, and moreโฆ
October 22, 2024 00:00:00
34.
๐งจ Diffusers welcomes Stable Diffusion 3.5 Large
October 22, 2024 00:00:00
35.
Releasing Outlines-core 0.1.0: structured generation in Rust and Python
October 22, 2024 00:00:00
36.
Deploying Speech-to-Speech on Hugging Face
October 22, 2024 00:00:00
37.
Llama 3.2 in Keras
October 21, 2024 00:00:00
38.
Fixing Gradient Accumulation
October 16, 2024 00:00:00
39.
Introducing the AMD 5th Gen EPYCโข CPU
October 10, 2024 00:00:00
40.
A Security Review of Gradio 5
October 10, 2024 00:00:00
41.
Welcome, Gradio 5
October 9, 2024 00:00:00
42.
Scaling AI-based Data Processing with Hugging Face + Dask
October 9, 2024 00:00:00
43.
Faster Assisted Generation with Dynamic Speculation
October 8, 2024 00:00:00
44.
Improving Parquet Dedupe on Hugging Face Hub
October 5, 2024 00:00:00
45.
Introducing the Open FinLLM Leaderboard
October 4, 2024 00:00:00
46.
A Short Summary of Chinese AI Global Expansion
October 3, 2024 00:00:00
47.
๐จ๐ฟ BenCzechMark - Can your LLM Understand Czech?
October 1, 2024 00:00:00
48.
Converting Vertex-Colored Meshes to Textured Meshes
September 30, 2024 00:00:00
49.
Llama can now see and run on your device - welcome Llama 3.2
September 25, 2024 00:00:00
50.
FineVideo: behind the scenes
September 23, 2024 00:00:00
51.
Exploring the Daily Papers Page on Hugging Face
September 23, 2024 00:00:00
52.
Optimize and deploy models with Optimum-Intel and OpenVINO GenAI
September 20, 2024 00:00:00
53.
Fine-tuning LLMs to 1.58bit: extreme quantization made easy
September 18, 2024 00:00:00
54.
Introducing the SQL Console on Datasets
September 17, 2024 00:00:00
55.
Introducing Community Tools on HuggingChat
September 16, 2024 00:00:00
56.
Accelerate 1.0.0
September 13, 2024 00:00:00
57.
Hugging Face partners with TruffleHog to Scan for Secrets
September 4, 2024 00:00:00
58.
Scaling robotics datasets with video encoding
August 27, 2024 00:00:00
59.
The 5 Most Under-Rated Tools on Hugging Face
August 22, 2024 00:00:00
60.
Improving Hugging Face Training Efficiency Through Packing with Flash Attention
August 21, 2024 00:00:00
61.
Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI
August 19, 2024 00:00:00
62.
A failed experiment: Infini-Attention, and why we should keep trying?
August 14, 2024 00:00:00
63.
Introduction to ggml
August 13, 2024 00:00:00
64.
Welcome FalconMamba: The first strong attention-free 7B model
August 12, 2024 00:00:00
65.
Tool Use, Unified
August 12, 2024 00:00:00
66.
XetHub is joining Hugging Face!
August 8, 2024 00:00:00
67.
2024 Security Feature Highlights
August 6, 2024 00:00:00
68.
Introducing TextImage Augmentation for Document Images
August 6, 2024 00:00:00
69.
Google releases Gemma 2 2B, ShieldGemma and Gemma Scope
July 31, 2024 00:00:00
70.
Memory-efficient Diffusion Transformers with Quanto and Diffusers
July 30, 2024 00:00:00
71.
Serverless Inference with Hugging Face and NVIDIA NIMs
July 29, 2024 00:00:00
72.
LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning?
July 25, 2024 00:00:00
73.
Llama 3.1 - 405B, 70B & 8B with multilinguality and long context
July 23, 2024 00:00:00
74.
WWDC 24: Running Mistral 7B with Core ML
July 22, 2024 00:00:00
75.
Docmatix - a huge dataset for Document Visual Question Answering
July 18, 2024 00:00:00
76.
TGI Multi-LoRA: Deploy Once, Serve 30 Models
July 18, 2024 00:00:00
77.
SmolLM - blazingly fast and remarkably powerful
July 16, 2024 00:00:00
78.
How we leveraged distilabel to create an Argilla 2.0 Chatbot
July 16, 2024 00:00:00
79.
How NuminaMath Won the 1st AIMO Progress Prize
July 11, 2024 00:00:00
80.
Announcing New Hugging Face and KerasHub integration
July 10, 2024 00:00:00
81.
Experimenting with Automatic PII Detection on the Hub using Presidio
July 10, 2024 00:00:00
82.
Preference Optimization for Vision Language Models
July 10, 2024 00:00:00
83.
Google Cloud TPUs made available to Hugging Face users
July 9, 2024 00:00:00
84.
Banque des Territoires (CDC Group) x Polyconseil x Hugging Face: Enhancing a Major French Environmental Program with a Sovereign Data Solution
July 9, 2024 00:00:00
85.
Announcing New Dataset Search Features
July 8, 2024 00:00:00
86.
Accelerating Protein Language Model ProtST on Intel Gaudi 2
July 3, 2024 00:00:00
87.
Our Transformers Code Agent beats the GAIA benchmark!
July 1, 2024 00:00:00
88.
Welcome Gemma 2 - Google's new open LLM
June 27, 2024 00:00:00
89.
XLSCOUT Unveils ParaEmbed 2.0: a Powerful Embedding Model Tailored for Patents and IP with Expert Support from Hugging Face
June 25, 2024 00:00:00
90.
Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models
June 24, 2024 00:00:00
91.
Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality
June 24, 2024 00:00:00
92.
Data Is Better Together: A Look Back and Forward
June 20, 2024 00:00:00
93.
Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap
June 19, 2024 00:00:00
94.
BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks
June 18, 2024 00:00:00
95.
From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate
June 13, 2024 00:00:00
96.
๐งจ Diffusers welcomes Stable Diffusion 3
June 12, 2024 00:00:00
97.
Putting RL back in RLHF
June 12, 2024 00:00:00
98.
Making sense of this mess
June 7, 2024 00:00:00
99.
Introducing the Hugging Face Embedding Container for Amazon SageMaker
June 7, 2024 00:00:00
100.
Launching the Artificial Analysis Text to Image Leaderboard & Arena
June 6, 2024 00:00:00
101.
Introducing NPC-Playground, a 3D playground to interact with LLM-powered NPCs
June 5, 2024 00:00:00
102.
Faster assisted generation support for Intel Gaudi
June 4, 2024 00:00:00
103.
Space secrets security update
May 31, 2024 00:00:00
104.
Benchmarking Text Generation Inference
May 29, 2024 00:00:00
105.
Training and Finetuning Embedding Models with Sentence Transformers v3
May 28, 2024 00:00:00
106.
Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens tokens and 11 languages
May 24, 2024 00:00:00
107.
CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models
May 24, 2024 00:00:00
108.
Unlocking Longer Generation with Key-Value Cache Quantization
May 16, 2024 00:00:00
109.
Deploy models on AWS Inferentia2 from Hugging Face
May 22, 2024 00:00:00
110.
Introducing Spaces Dev Mode for a seamless developer experience
May 21, 2024 00:00:00
111.
Build AI on premise with Dell Enterprise Hub
May 21, 2024 00:00:00
112.
Hugging Face on AMD Instinct MI300 GPU
May 21, 2024 00:00:00
113.
From cloud to developers: Hugging Face and Microsoft Deepen Collaboration
May 21, 2024 00:00:00
114.
PaliGemma โ Google's Cutting-Edge Open Vision Language Model
May 14, 2024 00:00:00
115.
Hugging Face x LangChain : A new partner package in LangChain
May 14, 2024 00:00:00
116.
Introducing the Open Arabic LLM Leaderboard
May 14, 2024 00:00:00
117.
License to Call: Introducing Transformers Agents 2.0
May 13, 2024 00:00:00
118.
Subscribe to Enterprise Hub with your AWS Account
May 9, 2024 00:00:00
119.
Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon
May 9, 2024 00:00:00
120.
Introducing the Open Leaderboard for Hebrew LLMs!
May 5, 2024 00:00:00
121.
Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face
May 3, 2024 00:00:00
122.
Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints
May 1, 2024 00:00:00
123.
Improving Prompt Consistency with Structured Generations
April 30, 2024 00:00:00
124.
StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation
April 29, 2024 00:00:00
125.
Introducing the Open Chain of Thought Leaderboard
April 23, 2024 00:00:00
126.
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
April 22, 2024 00:00:00
127.
Welcome Llama 3 - Meta's new open LLM
April 18, 2024 00:00:00
128.
The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare
April 19, 2024 00:00:00
129.
AI Apps in a Flash with Gradio's Reload Mode
April 16, 2024 00:00:00
130.
Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs
April 16, 2024 00:00:00
131.
Running Privacy-Preserving Inference on Hugging Face Endpoints
April 16, 2024 00:00:00
132.
Ryghtโs Journey to Empower Healthcare and Life Sciences with Expert Support from Hugging Face
April 16, 2024 00:00:00
133.
Introducing Idefics2: A Powerful 8B Vision-Language Model for the community
April 15, 2024 00:00:00
134.
Vision Language Models Explained
April 11, 2024 00:00:00
135.
Making thousands of open LLMs bloom in the Vertex AI Model Garden
April 10, 2024 00:00:00
136.
CodeGemma - an official Google release for code LLMs
April 9, 2024 00:00:00
137.
Hugging Face partners with Wiz Research to Improve AI Security
April 4, 2024 00:00:00
138.
Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B
April 4, 2024 00:00:00
139.
Blazing Fast SetFit Inference with ๐ค Optimum Intel on Xeon
April 3, 2024 00:00:00
140.
Public Policy at Hugging Face
April 8, 2024 00:00:00
141.
Bringing serverless GPU inference to Hugging Face users
April 2, 2024 00:00:00
142.
Pollen-Vision: Unified interface for Zero-Shot vision models in robotics
March 25, 2024 00:00:00
143.
Total noobโs intro to Hugging Face Transformers
March 22, 2024 00:00:00
144.
Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval
March 22, 2024 00:00:00
145.
Introducing the Chatbot Guardrails Arena
March 21, 2024 00:00:00
146.
A Chatbot on your Laptop: Phi-2 on Intel Meteor Lake
March 20, 2024 00:00:00
147.
Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models
March 20, 2024 00:00:00
148.
GaLore: Advancing Large Model Training on Consumer-grade Hardware
March 20, 2024 00:00:00
149.
Easily Train Models with H100 GPUs on NVIDIA DGX Cloud
March 18, 2024 00:00:00
150.
quanto: a pytorch quantization toolkit
March 18, 2024 00:00:00
151.
CPU Optimized Embeddings with ๐ค Optimum Intel and fastRAG
March 15, 2024 00:00:00
152.
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
March 15, 2024 00:00:00
153.
Introducing ConTextual: How well can your Multimodal model jointly reason over text and image in text-rich scenes?
March 5, 2024 00:00:00
154.
Data is better together
March 4, 2024 00:00:00
155.
Text-Generation Pipeline on Intelยฎ Gaudiยฎ 2 AI Accelerator
February 29, 2024 00:00:00
156.
StarCoder2 and The Stack v2
February 28, 2024 00:00:00
157.
TTS Arena: Benchmarking Text-to-Speech Models in the Wild
February 27, 2024 00:00:00
158.
AI Watermarking 101: Tools and Techniques
February 26, 2024 00:00:00
159.
Fine-Tuning Gemma Models in Hugging Face
February 23, 2024 00:00:00
160.
Introducing the Red-Teaming Resistance Leaderboard
February 23, 2024 00:00:00
161.
๐ช Introduction to Matryoshka Embedding Models
February 23, 2024 00:00:00
162.
Fetch Consolidates AI Tools and Saves 30% Development Time with Hugging Face on AWS
February 23, 2023 00:00:00
163.
Welcome Gemma - Google's new open LLM
February 21, 2024 00:00:00
164.
Introducing the Open Ko-LLM Leaderboard: Leading the Korean LLM Evaluation Ecosystem
February 20, 2024 00:00:00
165.
๐ค PEFT welcomes new merging methods
February 19, 2024 00:00:00
166.
Synthetic data: save money, time and carbon with open source
February 16, 2024 00:00:00
167.
AMD Pervasive AI Developer Contest!
February 14, 2024 00:00:00
168.
From OpenAI to Open LLMs with Messages API
February 8, 2024 00:00:00
169.
SegMoE: Segmind Mixture of Diffusion Experts
February 3, 2024 00:00:00
170.
NPHardEval Leaderboard: Unveiling the Reasoning Abilities of Large Language Models through Complexity Classes and Dynamic Updates
February 2, 2024 00:00:00
171.
Constitutional AI with Open LLMs
February 1, 2024 00:00:00
172.
Hugging Face Text Generation Inference available for AWS Inferentia2
February 1, 2024 00:00:00
173.
Patch Time Series Transformer in Hugging Face
February 1, 2024 00:00:00
174.
Introducing the Enterprise Scenarios Leaderboard: a Leaderboard for Real World Use Cases
January 31, 2024 00:00:00
175.
Accelerate StarCoder with ๐ค Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding
January 30, 2024 00:00:00
176.
The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models
January 29, 2024 00:00:00
177.
An Introduction to AI Secure LLM Safety Leaderboard
January 26, 2024 00:00:00
178.
Hugging Face and Google partner for open AI collaboration
January 25, 2024 00:00:00
179.
Open-source LLMs as LangChain Agents
January 24, 2024 00:00:00
180.
Fine-Tune W2V2-Bert for low-resource ASR with ๐ค Transformers
January 19, 2024 00:00:00
181.
PatchTSMixer in HuggingFace
January 19, 2024 00:00:00
182.
Preference Tuning LLMs with Direct Preference Optimization Methods
January 18, 2024 00:00:00
183.
Accelerating SD Turbo and SDXL Turbo Inference with ONNX Runtime and Olive
January 15, 2024 00:00:00
184.
A guide to setting up your own Hugging Face leaderboard: an end-to-end example with Vectara's hallucination leaderboard
January 12, 2024 00:00:00
185.
Faster fine-tuning using TRL & Unsloth
January 10, 2024 00:00:00
186.
Welcome aMUSEd: Efficient Text-to-Image Generation
January 4, 2024 00:00:00
187.
LoRA training scripts of the world, unite!
January 2, 2024 00:00:00
188.
Speculative Decoding for 2x Faster Whisper Inference
December 20, 2023 00:00:00
189.
2023, year of open LLMs
December 18, 2023 00:00:00
190.
Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face
December 11, 2023 00:00:00
191.
Mixture of Experts Explained
December 11, 2023 00:00:00
192.
AMD + ๐ค: Large Language Models Out-of-the-Box Acceleration with AMD GPU
December 5, 2023 00:00:00
193.
SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit
December 6, 2023 00:00:00
194.
Optimum-NVIDIA - Unlock blazingly fast LLM inference in just 1 line of code
December 5, 2023 00:00:00
195.
Goodbye cold boot - how we made LoRA inference 300% faster
December 5, 2023 00:00:00
196.
Open LLM Leaderboard: DROP deep dive
December 1, 2023 00:00:00
197.
SDXL in 4 steps with Latent Consistency LoRAs
November 9, 2023 00:00:00
198.
Make your llama generation time fly with AWS Inferentia2
November 7, 2023 00:00:00
199.
Introducing Prodigy-HF: a direct integration with Hugging Face
November 7, 2023 00:00:00
200.
Comparing the Performance of LLMs: A Deep Dive into Roberta, Llama 2, and Mistral for Disaster Tweets Analysis with Lora
November 7, 2023 00:00:00
201.
Introducing Storage Regions on the HF Hub
November 3, 2023 00:00:00
202.
Personal Copilot: Train Your Own Coding Assistant
October 27, 2023 00:00:00
203.
Interactively explore your Huggingface dataset with one line of code
October 25, 2023 00:00:00
204.
Deploy Embedding Models with Hugging Face Inference Endpoints
October 24, 2023 00:00:00
205.
The N Implementation Details of RLHF with PPO
October 24, 2023 00:00:00
206.
Exploring simple optimizations for SDXL
October 24, 2023 00:00:00
207.
Gradio-Lite: Serverless Gradio Running Entirely in Your Browser
October 19, 2023 00:00:00
208.
Accelerating over 130,000 Hugging Face models with ONNX Runtime
October 4, 2023 00:00:00
209.
Accelerating Stable Diffusion XL Inference with JAX on Cloud TPU v5e
October 3, 2023 00:00:00
210.
Chat Templates: An End to the Silent Performance Killer
October 3, 2023 00:00:00
211.
Deploying the AI Comic Factory using the Inference API
October 2, 2023 00:00:00
212.
Ethics and Society Newsletter #5: Hugging Face Goes To Washington and Other Summer 2023 Musings
September 29, 2023 00:00:00
213.
Finetune Stable Diffusion Models with DDPO via TRL
September 29, 2023 00:00:00
214.
Non-engineers guide: Train a LLaMA 2 chatbot
September 28, 2023 00:00:00
215.
Llama 2 on Amazon SageMaker a Benchmark
September 26, 2023 00:00:00
216.
Inference for PROs
September 22, 2023 00:00:00
217.
Rocket Money x Hugging Face: Scaling Volatile ML Models in Production
September 19, 2023 00:00:00
218.
Introduction to 3D Gaussian Splatting
September 18, 2023 00:00:00
219.
Object Detection Leaderboard
September 18, 2023 00:00:00
220.
Optimizing your LLM in production
September 15, 2023 00:00:00
221.
Introducing Wรผrstchen: Fast Diffusion for Image Generation
September 13, 2023 00:00:00
222.
Fine-tuning Llama 2 70B using PyTorch FSDP
September 13, 2023 00:00:00
223.
Overview of natively supported quantization schemes in ๐ค Transformers
September 12, 2023 00:00:00
224.
SafeCoder vs. Closed-source Code Assistants
September 11, 2023 00:00:00
225.
Efficient Controllable Generation for SDXL with T2I-Adapters
September 8, 2023 00:00:00
226.
Spread Your Wings: Falcon 180B is here
September 6, 2023 00:00:00
227.
Fetch Cuts ML Processing Latency by 50% Using Amazon SageMaker & Hugging Face
September 1, 2023 00:00:00
228.
AudioLDM 2, but faster โก๏ธ
August 30, 2023 00:00:00
229.
Code Llama: Llama 2 learns to code
August 25, 2023 00:00:00
230.
Deprecation of Git Authentication using password
August 25, 2023 00:00:00
231.
Making LLMs lighter with AutoGPTQ and transformers
August 23, 2023 00:00:00
232.
Introducing SafeCoder
August 22, 2023 00:00:00
233.
Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model
August 22, 2023 00:00:00
234.
Hugging Face Platform on the AWS Marketplace: Pay with your AWS Account
August 10, 2023 00:00:00
235.
Optimizing Bark using ๐ค Transformers
August 9, 2023 00:00:00
236.
Deploying Hugging Face Models with BentoML: DeepFloyd IF in Action
August 9, 2023 00:00:00
237.
Fine-tune Llama 2 with DPO
August 8, 2023 00:00:00
238.
Releasing Swift Transformers: Run On-Device LLMs in Apple Devices
August 8, 2023 00:00:00
239.
Deploy MusicGen in no time with Inference Endpoints
August 4, 2023 00:00:00
240.
Huggy Lingo: Using Machine Learning to Improve Language Metadata on the Hugging Face Hub
August 2, 2023 00:00:00
241.
Towards Encrypted Large Language Models with FHE
August 2, 2023 00:00:00
242.
Practical 3D Asset Generation: A Step-by-Step Guide
August 1, 2023 00:00:00
243.
Open-sourcing Knowledge Distillation Code and Weights of SD-Small and SD-Tiny
August 1, 2023 00:00:00
244.
Stable Diffusion XL on Mac with Advanced Core ML Quantization
July 27, 2023 00:00:00
245.
AI Policy @๐ค: Open ML Considerations in the EU AI Act
July 24, 2023 00:00:00
246.
Introducing Agents.js: Give tools to your LLMs using JavaScript
July 24, 2023 00:00:00
247.
Results of the Open Source AI Game Jam
July 21, 2023 00:00:00
248.
Happy 1st anniversary ๐ค Diffusers!
July 20, 2023 00:00:00
249.
Llama 2 is here - get it on Hugging Face
July 18, 2023 00:00:00
250.
Building an AI WebTV
July 17, 2023 00:00:00
251.
Open-Source Text Generation & LLM Ecosystem at Hugging Face
July 17, 2023 00:00:00
252.
Fine-tuning Stable Diffusion models on Intel CPUs
July 14, 2023 00:00:00
253.
Making ML-powered web games with Transformers.js
July 5, 2023 00:00:00
254.
Deploy LLMs with Hugging Face Inference Endpoints
July 4, 2023 00:00:00
255.
Making a web app generator with open ML models
July 3, 2023 00:00:00
256.
Leveraging Hugging Face for complex generative AI use cases
July 1, 2023 00:00:00
257.
Accelerating Vision-Language Models: BridgeTower on Habana Gaudi2
June 29, 2023 00:00:00
258.
Ethics and Society Newsletter #4: Bias in Text-to-Image Models
June 26, 2023 00:00:00
259.
What's going on with the Open LLM Leaderboard?
June 23, 2023 00:00:00
260.
Panel on Hugging Face
June 22, 2023 00:00:00
261.
Fine-tuning MMS Adapter Models for Multi-Lingual ASR
June 19, 2023 00:00:00
262.
AI Policy @๐ค: Response to the U.S. NTIA's Request for Comment on AI Accountability
June 20, 2023 00:00:00
263.
Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)
June 16, 2023 00:00:00
264.
Faster Stable Diffusion with Core ML on iPhone, iPad, and Mac
June 15, 2023 00:00:00
265.
Deploy Livebook notebooks as apps to Hugging Face Spaces
June 15, 2023 00:00:00
266.
Announcing our new Content Guidelines and Policy
June 15, 2023 00:00:00
267.
Hugging Face and AMD partner on accelerating state-of-the-art models for CPU and GPU platforms
June 13, 2023 00:00:00
268.
Can foundation models label data like humans?
June 12, 2023 00:00:00
269.
The Hugging Face Hub for Galleries, Libraries, Archives and Museums
June 12, 2023 00:00:00
270.
DuckDB: run SQL queries on 50,000+ datasets on the Hugging Face Hub
June 7, 2023 00:00:00
271.
Welcome fastText to the ๐ค Hub
June 6, 2023 00:00:00
272.
The Falcon has landed in the Hugging Face ecosystem
June 5, 2023 00:00:00
273.
AI Speech Recognition in Unity
June 2, 2023 00:00:00
274.
Announcing the Open Source AI Game Jam ๐ฎ
June 1, 2023 00:00:00
275.
Hugging Face Selected for the French Data Protection Agency Enhanced Support Program
May 15, 2023 00:00:00
276.
Introducing the Hugging Face LLM Inference Container for Amazon SageMaker
May 31, 2023 00:00:00
277.
Introducing BERTopic Integration with Hugging Face Hub
May 31, 2023 00:00:00
278.
Optimizing Stable Diffusion for Intel CPUs with NNCF and ๐ค Optimum
May 25, 2023 00:00:00
279.
Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA
May 24, 2023 00:00:00
280.
Hugging Face Collaborates with Microsoft to Launch Hugging Face Model Catalog on Azure
May 24, 2023 00:00:00
281.
Hugging Face and IBM partner on watsonx.ai, the next-generation enterprise studio for AI builders
May 23, 2023 00:00:00
282.
Safetensors audited as really safe and becoming the default
May 23, 2023 00:00:00
283.
Instruction-tuning Stable Diffusion with InstructPix2Pix
May 23, 2023 00:00:00
284.
Large-scale Near-deduplication Behind BigCode
May 16, 2023 00:00:00
285.
Smaller is better: Q8-Chat, an efficient generative AI experience on Xeon
May 16, 2023 00:00:00
286.
Run a Chatgpt-like Chatbot on a Single GPU with ROCm
May 15, 2023 00:00:00
287.
Introducing RWKV โ An RNN with the advantages of a transformer
May 15, 2023 00:00:00
288.
Assisted Generation: a new direction toward low-latency text generation
May 11, 2023 00:00:00
289.
Creating a Coding Assistant with StarCoder
May 9, 2023 00:00:00
290.
A Dive into Text-to-Video Models
May 8, 2023 00:00:00
291.
StarCoder: A State-of-the-Art LLM for Code
May 4, 2023 00:00:00
292.
How to Install and Use the Hugging Face Unity API
May 1, 2023 00:00:00
293.
Running IF with ๐งจ diffusers on a Free Tier Google Colab
April 26, 2023 00:00:00
294.
Training a language model with ๐ค Transformers using TensorFlow and TPUs
April 27, 2023 00:00:00
295.
Databricks โค๏ธ Hugging Face: up to 40% faster training and tuning of Large Language Models
April 26, 2023 00:00:00
296.
Introducing HuggingFace blog for Chinese speakers: Fostering Collaboration with the Chinese AI community
April 24, 2023 00:00:00
297.
How to host a Unity game in a Space
April 21, 2023 00:00:00
298.
Accelerating Hugging Face Transformers with AWS Inferentia2
April 17, 2023 00:00:00
299.
Graph Classification with Transformers
April 14, 2023 00:00:00
300.
Creating Privacy Preserving AI with Substra
April 12, 2023 00:00:00
301.
Snorkel AI x Hugging Face: unlock foundation models for enterprises
April 6, 2023 00:00:00
302.
StackLLaMA: A hands-on guide to train LLaMA with RLHF
April 5, 2023 00:00:00
303.
Ethics and Society Newsletter #3: Ethical Openness at Hugging Face
March 30, 2023 00:00:00
304.
Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 Accelerator
March 28, 2023 00:00:00
305.
Accelerating Stable Diffusion Inference on Intel CPUs
March 28, 2023 00:00:00
306.
Federated Learning using Hugging Face and Flower
March 27, 2023 00:00:00
307.
Train your ControlNet with diffusers
March 24, 2023 00:00:00
308.
Jupyter X Hugging Face
March 23, 2023 00:00:00
309.
Multivariate Probabilistic Time Series Forecasting with Informer
March 10, 2023 00:00:00
310.
Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU
March 9, 2023 00:00:00
311.
New ViT and ALIGN Models From Kakao Brain
March 6, 2023 00:00:00
312.
Using Machine Learning to Aid Survivors and Race through Time
March 3, 2023 00:00:00
313.
ControlNet in Diffusers ๐งจ
March 3, 2023 00:00:00
314.
Ethical guidelines for developing the Diffusers library
March 2, 2023 00:00:00
315.
How Hugging Face Accelerated Development of Witty Works Writing Assistant
March 1, 2023 00:00:00
316.
Red-Teaming Large Language Models
February 24, 2023 00:00:00
317.
Swift Diffusers: Fast Stable Diffusion for Mac
February 24, 2023 00:00:00
318.
Hugging Face and AWS partner to make AI more accessible
February 21, 2023 00:00:00
319.
Zero-shot image-to-text generation with BLIP-2
February 15, 2023 00:00:00
320.
Why weโre switching to Hugging Face Inference Endpoints, and maybe you should too
February 15, 2023 00:00:00
321.
๐ค PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware
February 10, 2023 00:00:00
322.
Speech Synthesis, Recognition, and More With SpeechT5
February 8, 2023 00:00:00
323.
Generating Stories: AI for Game Development #5
February 7, 2023 00:00:00
324.
Introducing โ๏ธ AI vs. AI โ๏ธ a deep reinforcement learning multi-agents competition system
February 7, 2023 00:00:00
325.
Accelerating PyTorch Transformers with Intel Sapphire Rapids, part 2
February 6, 2023 00:00:00
326.
A Dive into Pretraining Strategies for Vision-Language Models
February 3, 2023 00:00:00
327.
The State of Computer Vision at Hugging Face ๐ค
January 30, 2023 00:00:00
328.
2D Asset Generation: AI for Game Development #4
January 26, 2023 00:00:00
329.
Using LoRA for Efficient Stable Diffusion Fine-Tuning
January 26, 2023 00:00:00
330.
What Makes a Dialog Agent Useful?
January 24, 2023 00:00:00
331.
Optimum+ONNX Runtime - Easier, Faster training for your Hugging Face models
January 24, 2023 00:00:00
332.
3D Asset Generation: AI for Game Development #3
January 20, 2023 00:00:00
333.
Universal Image Segmentation with Mask2Former and OneFormer
January 19, 2023 00:00:00
334.
Welcome PaddlePaddle to the Hugging Face Hub
January 17, 2023 00:00:00
335.
Image Similarity with Hugging Face Datasets and Transformers
January 16, 2023 00:00:00
336.
AI for Game Development: Creating a Farming Game in 5 Days. Part 2
January 9, 2023 00:00:00
337.
Introduction to Graph Machine Learning
January 3, 2023 00:00:00
338.
AI for Game Development: Creating a Farming Game in 5 Days. Part 1
January 2, 2023 00:00:00
339.
Accelerating PyTorch Transformers with Intel Sapphire Rapids, part 1
January 2, 2023 00:00:00
340.
Zero-shot image segmentation with CLIPSeg
December 21, 2022 00:00:00
341.
Model Cards: Introducing HF Model documentation tools
December 20, 2022 00:00:00
342.
Ethics and Society Newsletter #2: Let's talk about bias!
December 15, 2022 00:00:00
343.
A Complete Guide to Audio Datasets
December 15, 2022 00:00:00
344.
Faster Training and Inference: Habana Gaudiยฎ2 vs Nvidia A100 80GB
December 14, 2022 00:00:00
345.
Illustrating Reinforcement Learning from Human Feedback (RLHF)
December 9, 2022 00:00:00
346.
From GPT2 to Stable Diffusion: Hugging Face arrives to the Elixir community
December 9, 2022 00:00:00
347.
Deep Learning with Proteins
December 2, 2022 00:00:00
348.
Using Stable Diffusion with Core ML on Apple Silicon
December 1, 2022 00:00:00
349.
Probabilistic Time Series Forecasting with ๐ค Transformers
December 1, 2022 00:00:00
350.
VQ Diffusion with ๐งจ Diffusers
November 30, 2022 00:00:00
351.
We are hiring interns!
November 29, 2022 00:00:00
352.
Diffusion Models Live Event
November 25, 2022 00:00:00
353.
Accelerating Document AI
November 21, 2022 00:00:00
354.
An Overview of Inference Solutions on Hugging Face
November 21, 2022 00:00:00
355.
Director of Machine Learning Insights [Part 4]
November 23, 2022 00:00:00
356.
Hugging Face Machine Learning Demos on arXiv
November 17, 2022 00:00:00
357.
Sentiment Classification with Fully Homomorphic Encryption using Concrete ML
November 17, 2022 00:00:00
358.
Generating Human-level Text with Contrastive Search in Transformers ๐ค
November 8, 2022 00:00:00
359.
Introducing our new pricing
November 8, 2022 00:00:00
360.
Training Stable Diffusion with Dreambooth using ๐งจ Diffusers
November 7, 2022 00:00:00
361.
Fine-Tune Whisper with ๐ค Transformers
November 3, 2022 00:00:00
362.
Accelerate your models with ๐ค Optimum Intel and OpenVINO
November 2, 2022 00:00:00
363.
Evaluating Language Model Bias with ๐ค Evaluate
October 24, 2022 00:00:00
364.
From PyTorch DDP to ๐ค Accelerate to ๐ค Trainer, mastery of distributed training with ease
October 21, 2022 00:00:00
365.
MTEB: Massive Text Embedding Benchmark
October 19, 2022 00:00:00
366.
Getting started with Hugging Face Inference Endpoints
October 14, 2022 00:00:00
367.
Stable Diffusion in JAX/Flax ๐
October 13, 2022 00:00:00
368.
Optimization story: Bloom inference
October 12, 2022 00:00:00
369.
Introducing DOI: the Digital Object Identifier to Datasets and Models
October 7, 2022 00:00:00
370.
Japanese Stable Diffusion
October 5, 2022 00:00:00
371.
Very Large Language Models and How to Evaluate Them
October 3, 2022 00:00:00
372.
Image Classification with AutoTrain
September 28, 2022 00:00:00
373.
How ๐ค Accelerate runs very large models thanks to PyTorch
September 27, 2022 00:00:00
374.
SetFit: Efficient Few-Shot Learning Without Prompts
September 26, 2022 00:00:00
375.
Ethics and Society Newsletter #1
September 22, 2022 00:00:00
376.
Incredibly Fast BLOOM Inference with DeepSpeed and Accelerate
September 16, 2022 00:00:00
377.
How to train a Language Model with Megatron-LM
September 7, 2022 00:00:00
378.
What's new in Diffusers? ๐จ
September 12, 2022 00:00:00
379.
Train your first Decision Transformer
September 8, 2022 00:00:00
380.
OpenRAIL: Towards open and responsible AI licensing frameworks
August 31, 2022 00:00:00
381.
Visualize proteins on Hugging Face Spaces
August 24, 2022 00:00:00
382.
Stable Diffusion with ๐งจ Diffusers
August 22, 2022 00:00:00
383.
Pre-Train BERT with Hugging Face Transformers and Habana Gaudi
August 22, 2022 00:00:00
384.
Deploying ๐ค ViT on Vertex AI
August 19, 2022 00:00:00
385.
Deep Dive: Vision Transformers On Hugging Face Optimum Graphcore
August 18, 2022 00:00:00
386.
A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes
August 17, 2022 00:00:00
387.
Introducing Skops
August 12, 2022 00:00:00
388.
Hugging Face's TensorFlow Philosophy
August 12, 2022 00:00:00
389.
Deploying ๐ค ViT on Kubernetes with TF Serving
August 11, 2022 00:00:00
390.
Train and Fine-Tune Sentence Transformers Models
August 10, 2022 00:00:00
391.
Proximal Policy Optimization (PPO)
August 5, 2022 00:00:00
392.
Introducing the Private Hub: A New Way to Build With Machine Learning
August 3, 2022 00:00:00
393.
Nystrรถmformer, Approximating self-attention in linear time and memory via the Nystrรถm method
August 2, 2022 00:00:00
394.
AI Policy @๐ค: Comments on U.S. National AI Research Resource Interim Report
August 1, 2022 00:00:00
395.
Introducing new audio and vision documentation in ๐ค Datasets
July 28, 2022 00:00:00
396.
Faster Text Generation with TensorFlow and XLA
July 27, 2022 00:00:00
397.
Deploying TensorFlow Vision Models in Hugging Face with TF Serving
July 25, 2022 00:00:00
398.
Advantage Actor Critic (A2C)
July 22, 2022 00:00:00
399.
How to train your model dynamically using adversarial data
July 16, 2022 00:00:00
400.
The Technology Behind BLOOM Training
July 14, 2022 00:00:00
401.
Building a Playlist Generator with Sentence Transformers
July 13, 2022 00:00:00
402.
Introducing The World's Largest Open Multilingual Language Model: BLOOM
July 12, 2022 00:00:00
403.
Getting Started with Sentiment Analysis on Twitter
July 7, 2022 00:00:00
404.
Policy Gradient with PyTorch
June 30, 2022 00:00:00
405.
Liftoff! How to get started with your first ML project ๐
June 29, 2022 00:00:00
406.
Accelerate Large Model Training using DeepSpeed
June 28, 2022 00:00:00
407.
Announcing Evaluation on the Hub
June 28, 2022 00:00:00
408.
Getting Started With Embeddings
June 23, 2022 00:00:00
409.
Convert Transformers to ONNX with Hugging Face Optimum
June 22, 2022 00:00:00
410.
Intel and Hugging Face Partner to Democratize Machine Learning Hardware Acceleration
June 15, 2022 00:00:00
411.
Director of Machine Learning Insights [Part 3: Finance Edition]
June 14, 2022 00:00:00
412.
The Annotated Diffusion Model
June 7, 2022 00:00:00
413.
Deep Q-Learning with Atari
June 7, 2022 00:00:00
414.
Graphcore and Hugging Face Launch New Lineup of IPU-Ready Transformers
May 26, 2022 00:00:00
415.
Introducing Pull Requests and Discussions ๐ฅณ
May 25, 2022 00:00:00
416.
Efficient Table Pre-training without Real Data: An Introduction to TAPEX
May 23, 2022 00:00:00
417.
An Introduction to Q-Learning Part 2
May 20, 2022 00:00:00
418.
How Sempre Health is leveraging the Expert Acceleration Program to accelerate their ML roadmap
May 19, 2022 00:00:00
419.
Putting ethical principles at the core of research lifecycle
May 19, 2022 00:00:00
420.
An Introduction to Q-Learning Part 1
May 18, 2022 00:00:00
421.
Machine Learning Experts - Sasha Luccioni Interview
May 17, 2022 00:00:00
422.
Announcing the Hugging Face Fellowship Program
May 17, 2022 00:00:00
423.
Gradio 3.0 is Out!
May 16, 2022 00:00:00
424.
Director of Machine Learning Insights [Part 2: SaaS Edition]
May 13, 2022 00:00:00
425.
Student Ambassador Program's call for applications is open!
May 13, 2022 00:00:00
426.
Accelerated Inference with Optimum and Transformers Pipelines
May 10, 2022 00:00:00
427.
We Raised $100 Million for Open & Collaborative Machine Learning ๐
May 9, 2022 00:00:00
428.
Welcome fastai to the Hugging Face Hub
May 6, 2022 00:00:00
429.
An Introduction to Deep Reinforcement Learning
May 4, 2022 00:00:00
430.
Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel
May 2, 2022 00:00:00
431.
Opinion Classification with Kili and HuggingFace AutoTrain
April 28, 2022 00:00:00
432.
Director of Machine Learning Insights [Series]
April 27, 2022 00:00:00
433.
Getting Started with Transformers on Habana Gaudi
April 26, 2022 00:00:00
434.
Introducing Hugging Face for Education
April 25, 2022 00:00:00
435.
Supercharged Customer Service with Machine Learning
April 25, 2022 00:00:00
436.
CO2 Emissions and the ๐ค Hub: Leading the Charge
April 22, 2022 00:00:00
437.
Machine Learning Experts - Lewis Tunstall Interview
April 13, 2022 00:00:00
438.
Habana Labs and Hugging Face Partner to Accelerate Transformer Model Training
April 12, 2022 00:00:00
439.
Don't repeat yourself - ๐ค Transformers Design Philosophy
April 5, 2022 00:00:00
440.
Introducing Decision Transformers on Hugging Face ๐ค
March 28, 2022 00:00:00
441.
Machine Learning Experts - Meg Mitchell Interview
March 23, 2022 00:00:00
442.
Announcing the ๐ค AI Research Residency Program
March 22, 2022 00:00:00
443.
Fine-Tune a Semantic Segmentation Model with a Custom Dataset
March 17, 2022 00:00:00
444.
Accelerate BERT inference with Hugging Face Transformers and AWS inferentia
March 16, 2022 00:00:00
445.
Image search with ๐ค datasets
March 16, 2022 00:00:00
446.
Guiding Text Generation with Constrained Beam Search in ๐ค Transformers
March 11, 2022 00:00:00
447.
BERT 101 ๐ค State Of The Art NLP Model Explained
March 2, 2022 00:00:00
448.
Fine-Tune ViT for Image Classification with ๐ค Transformers
February 11, 2022 00:00:00
449.
Getting Started with Sentiment Analysis using Python
February 2, 2022 00:00:00
450.
Making automatic speech recognition work on large files with Wav2Vec2 in ๐ค Transformers
February 1, 2022 00:00:00
451.
Supercharged Searching on the Hugging Face Hub
January 25, 2022 00:00:00
452.
Welcome Stable-baselines3 to the Hugging Face Hub ๐ค
January 21, 2022 00:00:00
453.
Case Study: Millisecond Latency using Hugging Face Infinity and modern CPUs
January 13, 2022 00:00:00
454.
Boost Wav2Vec2 with n-gram LM in ๐ค Transformers
January 12, 2022 00:00:00
455.
Deploy GPT-J 6B for inference using Hugging Face Transformers and Amazon SageMaker
January 11, 2022 00:00:00
456.
Active Learning with AutoNLP and Prodigy
December 23, 2021 00:00:00
457.
Gradio joins Hugging Face!
December 21, 2021 00:00:00
458.
Perceiver IO: a scalable, fully-attentional model that works on any modality
December 15, 2021 00:00:00
459.
Training CodeParrot ๐ฆ from Scratch
December 8, 2021 00:00:00
460.
Introducing Snowball Fight โ๏ธ, our First ML-Agents Environment
December 2, 2021 00:00:00
461.
Getting Started with Hugging Face Transformers for IPUs with Optimum
November 30, 2021 00:00:00
462.
Introducing the Data Measurements Tool: an Interactive Tool for Looking at Datasets
November 29, 2021 00:00:00
463.
Accelerating PyTorch distributed fine-tuning with Intel technologies
November 19, 2021 00:00:00
464.
Fine-tuning XLS-R for Multi-Lingual ASR with ๐ค Transformers
November 15, 2021 00:00:00
465.
Scaling up BERT-like model Inference on modern CPU - Part 2
November 4, 2021 00:00:00
466.
Course Launch Community Event
October 26, 2021 00:00:00
467.
Large Language Models: A New Moore's Law?
October 26, 2021 00:00:00
468.
Train a Sentence Embedding Model with 1B Training Pairs
October 25, 2021 00:00:00
469.
The Age of Machine Learning As Code Has Arrived
October 20, 2021 00:00:00
470.
Fine tuning CLIP with Remote Sensing (Satellite) images and captions
October 13, 2021 00:00:00
471.
Hosting your Models and Datasets on Hugging Face Spaces using Streamlit
October 5, 2021 00:00:00
472.
Showcase Your Projects in Spaces using Gradio
October 5, 2021 00:00:00
473.
Summer at Hugging Face โ๏ธ
September 24, 2021 00:00:00
474.
Hugging Face and Graphcore partner for IPU-optimized Transformers
September 14, 2021 00:00:00
475.
Introducing Optimum: The Optimization Toolkit for Transformers at Scale
September 14, 2021 00:00:00
476.
Deep Learning over the Internet: Training Language Models Collaboratively
July 15, 2021 00:00:00
477.
Welcome spaCy to the ๐ค Hub
July 13, 2021 00:00:00
478.
Deploy Hugging Face models easily with Amazon SageMaker
July 8, 2021 00:00:00
479.
Sentence Transformers in the ๐ค Hub
June 28, 2021 00:00:00
480.
Few-shot learning in practice: GPT-NEO and the ๐ค Accelerated Inference API
June 3, 2021 00:00:00
481.
Using & Mixing Hugging Face Models with Gradio 2.0
May 25, 2021 00:00:00
482.
Scaling-up BERT Inference on CPU (Part 1)
April 20, 2021 00:00:00
483.
Introducing ๐ค Accelerate
April 16, 2021 00:00:00
484.
Distributed Training: Train BART/T5 for Summarization using ๐ค Transformers and Amazon SageMaker
April 8, 2021 00:00:00
485.
Understanding BigBird's Block Sparse Attention
March 31, 2021 00:00:00
486.
The Partnership: Amazon SageMaker and Hugging Face
March 23, 2021 00:00:00
487.
My Journey to a serverless transformers pipeline on Google Cloud
March 18, 2021 00:00:00
488.
Fine-Tune Wav2Vec2 for English ASR with ๐ค Transformers
March 12, 2021 00:00:00
489.
Hugging Face Reads, Feb. 2021 - Long-range Transformers
March 9, 2021 00:00:00
490.
Simple considerations for simple people building fancy neural networks
February 25, 2021 00:00:00
491.
Retrieval Augmented Generation with Huggingface Transformers and Ray
February 10, 2021 00:00:00
492.
Hugging Face on PyTorch / XLA TPUs
February 9, 2021 00:00:00
493.
Faster TensorFlow models in Hugging Face Transformers
January 26, 2021 00:00:00
494.
Fit More and Train Faster With ZeRO via DeepSpeed and FairScale
January 19, 2021 00:00:00
495.
How we sped up transformer inference 100x for ๐ค API customers
January 18, 2021 00:00:00
496.
Leveraging Pre-trained Language Model Checkpoints for Encoder-Decoder Models
November 9, 2020 00:00:00
497.
Porting fairseq wmt19 translation system to transformers
November 3, 2020 00:00:00
498.
Hyperparameter Search with Transformers and Ray Tune
November 2, 2020 00:00:00
499.
Transformer-based Encoder-Decoder Models
October 10, 2020 00:00:00
500.
Block Sparse Matrices for Smaller and Faster Language Models
September 10, 2020 00:00:00
501.
The Reformer - Pushing the limits of language modeling
July 3, 2020 00:00:00
502.
How to generate text: using different decoding methods for language generation with Transformers
March 1, 2020 00:00:00
503.
How to train a new language model from scratch using Transformers and Tokenizers
February 14, 2020 00:00:00