Blogs related to Data Science
Industry related
Our Authors
Martin Shine
Sally Wrigley
Sara Boltman
Jacob Mckenzie
James Hume
Lucy Wilson
Patrick Arthur
During this year’s Public Sector AI Week, Sara Boltman, co-founder of Butterfly Data, spoke about the growing role of artificial intelligence in government and public services. Drawing from her background in data science and machine learning, she highlighted AI's potential to improve decision-making while warning of the risks of bias without proper oversight.
Butterfly Data and Oxford Insights developed the Company Analysis Tool (CAT) to identify risks in public contracts using AI. It integrates UK contracting data with ‘red flag’ methodologies to flag suspicious companies.
In this articles we will explore how to use Large Language Models (LLMs) for Semantic Analysis. In recent years, Large Language Models (LLMs) have revolutionised the field of natural language processing (NLP), venturing into the new frontiers...
In data science or data-based projects, the quality of data is one of the most pertinent aspects to be considered. In the case of developing intelligent systems using various AI (Artificial Intelligence) techniques…
Cricket is one of many sports that has undergone a “data revolution” over the last 10 years. This follows in the footsteps of the famous “Moneyball” philosophy which revolutionised baseball in the early 2000s…
In our modern world, data is used almost everywhere to make our lives easier, to sell us things, or even to give elite athletes an edge over their competitors…
One of our directors, Sara, recommended I read this book a long while ago, so I gave it a read for my next blog post. Nate Silver has a background in statistics and making predictions…
Governments around the world are going digital. Considering this, the OECD published a ranking table which rated the UK as 2nd best worldwide…
Two areas of machine learning (ML) have undergone rapid advances over the last few years: Natural Language Processing (NLP) and Image Recognition…
Butterfly Data are proud to have taken part in the Digital Sandbox Sustainability Cohort this year. Successful Companies, selected by the Financial Conduct Authority (FCA) and City of London Corporation (COCL), were challenged to tackle problems in finance…
Data anonymisation is the process of protecting sensitive or private information by erasing or encrypting identifying information about an individual in data. Some of the data this can relate to might include names or locations. Subjective information such as…
The credit union consolidation rate has held steady at about 3.5% a year for the past 40 years. The most common merger scenario has larger entities acquiring small and midsize credit unions. Analysts are speculating that pressures on the biggest credit unions will force many to consider merger strategies in the months to come.
In the first eight months of 2021, publicly announced mergers and acquisitions (M&A) were valued at more than $3.6 trillion globally and $1.8 trillion in the US according to Dealogic. Both numbers are the highest since 1995, when the data provider began its tracking.
Fuzzy Matching, also known as Approximate String Matching, is a technique to identify whether two strings are similar but not the same. Some everyday uses of fuzzy matching include: spell check, auto-correct, spam filtering, record linkage, and address matching.
For my final year University project, I was tasked with using Artificial Intelligence to solve a real-world problem. The misinformation I was seeing every day sprawled over social media about such things as the American election and, more recently, the Covid-19 pandemic, sprang to mind…
Here at Butterfly, we love to celebrate our team and particularly today we want to recognise the achievements and unique entrance to our team of James Lancashire
At the start of this year, Kane Williamson scored 238 in New Zealand’s first innings of their second Test match against Pakistan in Christchurch. Remarkably, this was the first time…