Thoughts on programme title cleaning

The most frustrating thing for me about working with Student Record data has always been the free-text Course Title field. Improving the speed and quality of analysis means better decision making.

Across three years of data, there are almost 59,000 different first degree and PGT programme titles, with misspellings, random codes and hundreds of different ways to write “with placement” making powerful programme-level analysis more difficult.

With UniViz, you can receive insightful analysis more quickly as I have made three key additions that improve programme titles:

1) The 𝗨𝗻𝗶𝗩𝗶𝘇 𝗣𝗿𝗼𝗴𝗿𝗮𝗺𝗺𝗲 𝗧𝗶𝘁𝗹𝗲 strips out all of the unnecessary information, leaving only a clean, consistent titles. This means programme searching and cleaning takes minutes, not days.
2) 𝗔𝘄𝗮𝗿𝗱𝘀 have been removed from programme titles and given their own column in the data. This makes trends more obvious as there is no need to spend time matching a BSc in one year to a title without an award in others).
3) The 𝗨𝗻𝗶𝗩𝗶𝘇 𝗣𝗿𝗼𝘃𝗶𝗱𝗲𝗿 field improves how satellite campuses and partner institutions are treated. As far as possible, they are now separate, so you can benchmark against your peers, not the students they provide degrees for in other parts of the country.

I now work with almost 𝟰𝟱% 𝗳𝗲𝘄𝗲𝗿 𝘁𝗶𝘁𝗹𝗲𝘀, freeing my time to focus on what actually matters: deep-diving into your programme’s performance and making recommendations to boost its market position.

Clean data isn’t just a nice-to-have, it is vital to improving the recommendations we can make from it too.

𝗣𝗹𝗲𝗮𝘀𝗲 𝗴𝗲𝘁 𝗶𝗻 𝘁𝗼𝘂𝗰𝗵 𝘁𝗼𝗱𝗮𝘆 𝗶𝗳 𝘆𝗼𝘂 𝗵𝗮𝘃𝗲 𝗮𝗻𝘆 𝗽𝗿𝗼𝗷𝗲𝗰𝘁𝘀 𝘄𝗵𝗶𝗰𝗵 𝗰𝗼𝘂𝗹𝗱 𝗯𝗲𝗻𝗲𝗳𝗶𝘁 𝗳𝗿𝗼𝗺 𝗶𝗻𝘀𝗶𝗴𝗵𝘁𝗳𝘂𝗹 𝗽𝗿𝗼𝗴𝗿𝗮𝗺𝗺𝗲-𝗹𝗲𝘃𝗲𝗹 𝗮𝗻𝗮𝗹𝘆𝘀𝗶𝘀.