Friday, July 22, 2022

Home A.I. Charles H Martin Khareem Sudlow PhD Better than BERT: Pick your best model #AI

Better than BERT: Pick your best model #AI

#A.I.

Have you ever had to sort through HuggingFace to find your best model ? There are over 54,000 models on HuggingFace! So it’s not an easy task.

Most people just choose the most popular model–and this is usually BERT. Or some BERT variant. Bert was created by Google, so it must be good.

But is BERT the really best choice for you ?

How can you find out ? You can search through the literature, read blogs, ask on Reddit, etc, and try to find a better model. This is time consuming and imperfect. Fortunately, there is a better way.

The open-source weightwatcher tool can tell you.

WeightWatcher is an open-source, data-free diagnostic tool that can estimate the quality of an DNN model like BERT, GPT, etc–without needing any data! (No training or test data–just the weights). It has been featured in JMLR, at ICML and KDD, and even in Nature.

Here’s an example using weightwatcher to compare of 3 NLP models: BERT, RoBERTa, and XNLet

The WeightWatcher Power-Law (PL) metric alpha $(\alpha)$ is a DNN model quality metric; smaller is better. This plot above displays all the layer alpha $(\alpha)$ values for the 3 models. It is immediately clear that the XNLet layers look much better than BERT or RoBERTa; the alpha $(\alpha)$ values are smaller on average, and there are no alphas larger than 5: $(\alpha <=5)$ . In contrast, the BERT and RoBERTa alphas are much larger and average, and both models have too many large alphas.

This is totally consistent with the published results.: In the original paper (from Microsoft Research), XLNet outperforms BERT on 20 tasks.

Do it yourself:

WeightWatcher will work with any HuggingFace Transformer (or CV) model.

Here is a Google Colab notebook that lets you reproduce this yourself

Give it a try. And if you need help with AI, ML, or just Data Science, please reach out. I provide strategy consulting, data science leadership, and hands-on, heads-down development. I will have availability in Q3 2022 for new projects. Reach out today. #talkToChuck #theAIguy

via https://AIupNow.com

Charles H Martin, PhD, Khareem Sudlow

Breaking

Glory Days (Instrumental) by BruceDayne

Indian Summer (2020 Encore Mix) by BruceDayne

Housecall Pro Unveils Platform Updates to Support Growth, Efficiency, and Safety for Home Service Professionals #Ecommerce

Wesley Chan on what he looks for as he’s shopping for potential unicorns #Ecommerce

The Biggest B2B eCommerce Trends You Can’t Ignore #Ecommerce

Friday, July 22, 2022

Better than BERT: Pick your best model #AI

Author Details

Fresh Beats Added Daily!

Facebook

Announcing Windows 11 Insider Preview Build 22635.4440 (Beta Channel) #Azure

Announcing the Microsoft Store Awards 2024 winners #Azure

How to prepare for Windows 10 end of support by moving to Windows 11 today #Azure

Announcing Windows 11 Insider Preview Build 26120.2200 (Dev Channel) #Azure

AWS Weekly Roundup: Oracle Database@AWS, Amazon RDS, AWS PrivateLink, Amazon MSK, Amazon EventBridge, Amazon SageMaker and more #AWS

AWS Weekly Roundup: AWS Parallel Computing Service, Amazon EC2 status checks, and more (September 2, 2024) #AWS

Empowering builders with the new AWS Asia Pacific (Malaysia) Region #AWS

Continuous reinvention: A brief history of block storage at AWS #AWS

iFixit's Apple Watch Series 6 teardown discovers larger capacity batteries

The 8th-generation iPad is already $30 off at Walmart

The Apple Watch doesn't come with a power adapter anymore

Apple signs former HBO chief to a five-year deal

Instrumentals to Code to

Virtual Reality

Get the most out of your game with these PC gaming headsets

A fan is attempting to make a Halo: Reach VR mod on PC #VR

Magic Leap reportedly only sold 6,000 AR headsets in six months #VR

Low budget VR set up

Archive

Tags

What is A.I. Up to Now?

Connect with us

Trending

Contact Form

Contact