Nick's Path to AI enlightenment

Soldato
Joined
13 Jan 2003
Posts
24,258
So I thought I'd put together a thread around my move towards a better professional understanding of AI. This is a complex and very varied landscape but I hope some of this provide inspiration to anyone else in the same unfortunate position with difficult job market.

So for a first post I'll split this into two - business and the technology. I'll be approaching both.

1. Business
So I went from engineering to presales, then to product with new market and existing products, startups to being accountable for 70+ as part of a very large 3y/2.7Bn programme and the last role basically building out cloud for a multi-national conglomerate at a family level of products.. so my interest here is not "what is AI" but more the business, including the obvious financial approaches, operational models and all the normal fun IT stuff. How do you estimate a AI development using models for example? Cost Structures. Best practices etc.
I have a fairly good idea by cross referencing my experience, the technical business management/finops experience for the cloud work etc with observations but I wanted to get a structured approach, have industry feedback and also get a badge on the CV that says growth mindset.
Approach here is to enroll and pay for a "certified" 6 week course from a recognised university. I don't get a degree but I do get a confirmation of attendance and a mark at the end of it from the assignments. I will also have some live lectures and some group working.

Obviously I can't provide the course, but I will confirm my observations.

We've already seen an ecosystem of AI.. AI as a service etc. Simply put, before the course has stated I'm already aware of:
a) Large company vendors generate a general model (lots of CPU time) based on industry available data. For example the models that AWS provide through Marketplace for Bedrock.
b) Smaller vendors then often produce specialisations of the models or additional models that integrate using a more specialised set of data (that may or may not be specific for your company).
c) Your company then has to expend more time and cost (let's call this development) with specialist skills to train/characterise and confirm acceptance of the resulting model integrations (more development).
d) Competition and data privacy laws and regulations then become part of the morality frameworks, along with the purpose and exposure of individual's data or affected.

I start on the 20th, so this will be and interesting course.. I finish late January.


2. Technology itself
I have a BSc Software Engineering (parallel and distributed computation) so I have some maths, I have A-level maths in the distant past and some maths used for performing image processing - specifically FFT, phase correlation (dot product) and 2/3D maths around spread function distributions in terms of probability. I also spent some time around PHD/Professor maths/physics for quantum computing and photonic cryptography - in short I can appreciate things like error, distributions etc but have also played a bit with randomness and Markov chains but not in any structured fashion. I've basically built maths for implementations from scratch based on scientific papers before. Including finite response and I've also (at school) built a backwards propagation neural network from a Scientific American article back in the early 1980s/1990s.
So with that - basically most devs that have done some maths should be able to follow along.

Approach here is:
* learn the Python programming language, this is going to be painful as it's not a language I like. I preferred R to be honest..
* learn the maths required - specifically calculus, linear algebra and probability theory.
* Lean some ML dev stack - Jupyter notepad, panda, bumpy, and maths plotting. Basically libraries and tools at a very basic level.
* Start with ML courses:
** Stanford Online - ML specialisation
** Andrej Karipathy - Neural Networks
** Stanford Online - Deep Learning specialisation

I'll them probably play with some Kaggle challenges.

Maths - I have some maths school books that specifically deal with calculus, linear algebra and probability theory - that should be a good start.

The rest is simply a case of building myself a virtual machine, installing linux etc and then hacking away as I have done before as the founder of a quantum hackathon team. Thankfully I had some team members with really good maths skills that previously worked in quantum.

So if you feel that you want to embrace the AI change in the workplace, and put a step in that direction. This thread may be a good way.
 
So on Calculus it depends a little on your level of maths but:
Recap:
. this is basic limits, derivatives and integrations but it doesn't tackle continuous functions.

I also have the following books
* Cambridge University Press - AS & A-level maths: Pure Mathematics 2 & 3 Coursebook. This covers algebra, differentiation, integration etc
* Cambridge University Press - AS & A-level maths: Probability & Statistics 1 Coursebook. This covers data, variation, probability, perms & combs, distributions

The idea here is to have a basic knowledge and vocab for communicating and understanding data scientists etc.

One of the best channels and learning is 3blue1brown - I can't explain how good this channel is (it can go really really advanced but this is is simple):

Update - I've gone through the 3B1B Calculus and derivatives and just starting the integration video. Tomorrow should see the end of that. Then I'll start on the probability etc. There's a lot of videos and they're really good as recap/slower pace than the books but I will also continue with the books.
 
Last edited:
Just some suggestions you or others reading the thread might find useful re: these topics:

Approach here is:
* learn the Python programming language, this is going to be painful as it's not a language I like. I preferred R to be honest..

If you like R then check out:

There is also a free book - it covers a lot of classic ML/statistical learning. And you can try the full-fat "ESL" book too, also available for free online. The basic ISL book has practical code examples (in R), the ESL book is mostly theory.

Then for comparison you could try the same stuff but with Python:


That then covers classic ML basics and might be useful as comparison/transition from R to Python.

* learn the maths required - specifically calculus, linear algebra and probability theory.

Looks like your background is fine, you've got calculus covered with 3Blue1Brown, check out MIT OpenCourseWare for more mathematics - in particular, see Gilbert Strang's lectures for Linear Algebra - he's basically the GOAT for teaching that subject.

You're perhaps missing some optimization too, check out Stephen Boyd's courses here:

Lastly, in addition to getting the basics of Linear Algebra, you'll want the numerical/computational stuff too - this is a great book:


and here is a free online course that makes use of it too:

* Lean some ML dev stack - Jupyter notepad, panda, bumpy, and maths plotting. Basically libraries and tools at a very basic level.
* Start with ML courses:
** Stanford Online - ML specialisation
** Andrej Karipathy - Neural Networks
** Stanford Online - Deep Learning specialisation

I think autocorrect has done you dirty there! Pandas will be very straightforward for you to pick up if you're familiar with R, NumPy is quite like Matlab.

Check out this book for some good coverage of pandas (and some NumPy), again a great book and kindly provided free online. (I tend to buy physical copies too if I'm going to spend any length of time with a book).


Also just in general, I linked to an old fast.ai course above, fast.ai stuff is great, you should check out their current two AI/ML courses and their book (which is also free online with associated notebooks).

Installing Anaconda is something you might find useful if you want to play around with jupyter locally and play around with/break things in dedicated environments without messing up other projects. For a zero-faffing-about option though try: https://colab.google/ it's based on jupyter notebooks and you can try a load of basic stuff completely free if you only require a CPU + it's easy enough to access far more powerful GPUs with credits. You can easily run through all the fast.ai examples in colab without having to faff around installing stuff and setting up environments.

some more books:

This is another very practical book, by a former Google Brain guy, it has a nice coverage of classic ML before getting into Deep Learning.

This is the go-to deep learning book for an overview of the theory side of things, also has initial coverage of classic ML, also free online, it's pretty accessible with some basic undergrad maths:


Also if of interest - this is the go-to RL book:


Also if you want some reference books with broad coverage of ML then check out the books by Bishop and Murphy.
 
What's the end result you're expecting of this? From my understanding on the career thread you have held some pretty senior / strategic positions... you seem to going pretty deep here into technology not the outcomes it drives?
 
What's the end result you're expecting of this? From my understanding on the career thread you have held some pretty senior / strategic positions... you seem to going pretty deep here into technology not the outcomes it drives?

Correct - senior focus. It has a couple of modules on AI, deep learning but also covers ethics and AI in organisations.

Learning deeper, helps understand more. I don't need to the nth degree but I'd like to develop something from principles thus know how that maps.
 
Last edited:
What's the end result you're expecting of this? From my understanding on the career thread you have held some pretty senior / strategic positions... you seem to going pretty deep here into technology not the outcomes it drives?

There's stuff out there that's aimed at giving more of an overview but isn't wishy-washy, still delivered by credible institutions etc. This sort of thing sort of sits in between MOOCs and full-fat university courses. While these short courses don't count for academic credit they do count towards CPD hours, useful for people who are members of professional associations etc.

They're also often priced at a level that's easy to justify in training budgets or perhaps well within the already allocated training allowance some employees may have.

Overview of AI for managers @ Oxford:

Data-driven decision-making @ Cambridge:

AI Ethics (this again will be important for managers) @ LSE (I think Oxford has a short AI Ethics course too)

Maybe not worth it if you're self funding but if you've got a training budget and your employer is paying then these are very current/relevant things to justify asking for.

(Also just on the subject in general - LSE offers an "MBA Essentials" short course (again no credit but does count for CPD hours) and each year Santander sponsors a load of free places if you apply via their academy, I don't think you actually even need to be a customer of the bank. Not something I'd pay for myself but if it's free then...)
 
Last edited:
An interesting podcast with the Chief Data Officer of BAE Systems - this has some great organisational observations (not surprisingly) but also highlights the need to focus on the problem and the where people cost has given way to energy costs (and reliance on power and networking that could/will be under threat as we move towards a war with russia).

 
Well .. you can't escape doing ML optical character recognition as the ML "hello world"..

jZolguk.png
 
Last edited:
So I've been getting back into the course over the last two days (although I have the lurgy). Rather than battle on this afternoon I decided on a simpler task to understand some of benefits and disadvantages of coding with ChatGPT models.

The task - create a Xcode macOS application to display a configurable guitar fretboard.

ChatGPT4 seemed to understand the prompt interactions better but still managed to give code that failed to check index limits resulting in a segmentation violation unless explicitly told not to.

The ChatGPT earlier model came up with a completely different code project, the code is completely different and the project itself is define differently - using the same text for prompts.

But it's been interesting - will continue tomorrow when the free credits reset.
 
Can ChatGPT code?

tEeAnqc.png


TL;DR - it would not replace a graduate joiner at the moment.

I chose a fiendishly simple application - tune a guitar through the audio coming from a USB interface on the Apple Mac. This has some intesting gotchas that are obvious but not obvious:
* application operating system interactions need authorisation - in this case the audio needs authorising by the user
* audio interactions using USB require higher QoS thread scheduling to prevent hangs etc, especially with 24bit 192kHz sample rate
* audio processing by default should use fast libraries to perform analysis (my target here in my head is the vDSP library to provide the maths for FFT)
* detecting the fundamental frequency of the guitar string in the noise of of the audio spectrum that the pick up hears - this includes both standing string wave, harmonics and the travelling wave+harmonics of the string pick that decays to leave the standing wave.
* multiple strings and frequency variations means there's not going to be a perfect frequency match so typically this would be "what accuracy would you consider being close to note X?".
* accuracy needs to be in the 0.01Hz region to detect 'cent' (ie detect each fret space divided into 1200 steps) to 0.1 cent.
* I indeed asked that it should use a notch filter to improve the detection, but also detect non-notched (ie inverse) for detecting fret buzz where strings are too close to the frets and rattle the strings.

So here's some experiences just using ChatGPT to code:

1. It didn't validate configurable parameters - so adding 1 string resulted in a core dump crash. I had to specifically tell it that the application should not crash due to unchecked array index values.
2. You have to be very particular in what you are telling it.. and it often talks about doing stuff but doesn't. It decided not to bother coding the notch filters or calculate the quality based on noise based on non-notched frequency, instead it decided to give me the power of the frequency. Even though it talked about notching - that doesn't exist, nor does notched-based quality measurements.
3. I had to tell it which frameworks and tools to use, it stops and says it needs something there but needs interactions to define the path it takes (ie use vDSP for FFT).
4. It wrote unsafe pointer code.. again this is crash material.
5. It wrote code that has concurrency issues - it attempts to run the audio code on the main GUI thread.. so that's has serious consequences in hangs etc.
6. It needed me to sort out the nuances of audio authorisation - both declaring the app uses the audio input and to add code to seek the users permission.
7. The resulting detection of notes is .. well.. lacking. It assumes the code it writes is correct and works. It doesn't create tests to demonstrate the code is correct and complies to the requirements given.

I'll give it a couple of years to get good.. because at the moment it's not going to get the coding job. It requires lots of knowledge of the coding, of the requirements, and of development processes to be a viable 'coder', even to suggest coding examples without due-diligence.

Still - it was fun.
 
It’s only good for basic boilerplate and some rubber duck conversations.

I often find that CoPilot creates methods that have never existed or were deprecated years ago.
 
What will be interesting is if the code languages themselves end up being generated by AI - specially to remove some of the 'human' language syntax restrictions.

I've heard that about CoPilot and GitHub is that ChatGPT is better and that's saying something!
 
What will be interesting is if the code languages themselves end up being generated by AI - specially to remove some of the 'human' language syntax restrictions.

I've heard that about CoPilot and GitHub is that ChatGPT is better and that's saying something!
CoPilot is "ChatGPT", with the default using 4o but you can pick o1 and o1-mini as additional options, however, they are opening it up to more models, Claude 3.5 being one.
 
CoPilot is "ChatGPT", with the default using 4o but you can pick o1 and o1-mini as additional options, however, they are opening it up to more models, Claude 3.5 being one.

My experience so far is ChatGPT directly, I haven't used Claude (requires mobile number even with google login!) nor CoPilot.

When I have more time I'll look further afield. I did look at the google AI ecosystem architecture too but for now it's keeping it simple.

I have noted that people use "ChatGPT" even when the model is not.
 
Last edited:
So I'm now 3/6 modules completed with decent marks so far. It's also occurring to me I've learnt more as part of this course than I would if I'd simply hacked away using the more detailed Google AI training (and I've done the parallel neural network course) as you're forced to think through examples and describe the thinking/approaches etc.

It's been interesting hearing about AI changes that are rattling around in the UK legal sector. How healthcare in the US classified AI as a medical device for example and that gets used as a US regulatory reference at the moment.

A discussion on safety was an interesting one. Plenty to think about but most people think AI vs humans, rather than competitive AIs competing against each other AND humans. It's also starting to dawn on people that if you replace workers then the government has to charge more in corporation tax to support the missing taxation revenue.

Can AI replace humans - undoubtedly so. Will AI replace humans? Money and resources but mainly planetary resources will be an issue.

Now if AI deems humans as legacy, heads for the stars and leaves it's mess behind for humans to cleanup, that would not surprise me.
 
I’m now deep into Ethics of AI as part of the 5th module, currently upto my eyeballs going through privacy law (wider than GDPR).

The modules 1-3 have been quite technically deep but also wide. Talking to some fellow students at a meetup evening, it’s clear that everyone is saying “7-10 hours/week” is more like 21-30 hours in reality.
The modules 4 & 5 have moved away from technical (but you still need that nuanced understanding) toward organisational use, ethics and finally business cases etc.

i think M4 may be a weak essay from me - I had cold and didn’t want to take an extension or I’d be behind. Will find out next week.

I oddly feel I seem to be approaching subjects in the essays thay the next module is about.
 
Starting to integrate more AI into my work flows. Definitely a big productivity boost when you're picking up a new language, and doing some repetitive stuff.

Writting some fusion nodes for Da Vinci Resolve, first time I've used Lua in a while, Copilot has been very useful.
 
Back
Top Bottom