I did some reading RE: Charpy test. Since it's qualitative, comparing one piece of steel against another, if you used the same pendulum, same sample size, and same measuring standards, it seems like a person could actually rig up a crude tester that would provide qualitative comparisons. I assume there would be several variables within the heat treating process, including (but not limited to): hold temperature, hold time, temper temperature and time, and quench medium.
That said, so far I've been using a process similar to mstephen. I don't have great temperature control, so my results have been somewhat spotty. Even so, I do the best I can with what I've got. I start with the usual "nonmagnetic, quench in oil" using heated vet grade mineral oil. I start with a temper at 400, then test the edge. If it chips, I bump up to 425 for the next cycle and test again. So far I've found out several things about the two major kinds of mystery steel I have been using. One, the trap springs I have used do get hard, but 400 is too high to temper, as they seem to lose their edge holding ability if tempered much higher than that. Two, nicholson files will chip on the edge every time if you temper at 400, but usually not at 425. I hate for the ultimate answer to be "spend hundreds or thousands of dollars on equipment," but at the same time, all of this education (thanks Kevin et al, sincerely) is making me less and less satisfied with the tools I have at my disposal. I suppose if at the end of the day I can make a knife that serves my purposes, and can repeat the performance again the next time, I've made some kind of progress, considering that when I started, there were no guarantees that the knife would perform as I wanted, or that I could do it again the same way.
I do know that I certainly have a scientific mind on a backwoods budget when it comes to this knifemaking stuff.