W!o+ 的《小伶鼬工坊演義》︰神經網絡【超參數評估】一




據聞當初 阿隆佐‧邱奇 Alonzo Church 用『λ 運算』研究『可計算性』問題時,並不知道它自身就是一個『世界上最小的通用性程式語言』。因為『函式』與『變元』兩者是任何人不管想用哪種『□□程式語言』來寫『演算法Algorithm 都需要清楚理解的『概念』。抽象精巧正是為什麼,讀過『λ 運算』的人,多半覺得它既『難懂』又『難解』。這是有原因的,如果用『抽象辦法』談論著『抽象事物』,又不知道為何如此表述當然『難懂』;假使不能『困思勉行』多次的『深思熟慮』 ,以至於能夠一旦了悟那就自然『難解』。通常越是『基本』的概念,由於太過『直覺』了,反而容易『誤解』。就像化學元素『週期表』上的元素不過一一八個,它所構成的世界卻是千嬌萬媚繁多複雜,要講『』的『性質』與『作用』,也許一大本書都不能窮盡,但換個方向說鐵不就是日用之物的嗎?

也許因記問之學難成大器乎??所以 Michael Nielsen 先生作此難題連連之文︰

How to choose a neural network’s hyper-parameters?

Up until now I haven’t explained how I’ve been choosing values for hyper-parameters such as the learning rate, η, the regularization parameter, λ, and so on. I’ve just been supplying values which work pretty well. In practice, when you’re using neural nets to attack a problem, it can be difficult to find good hyper-parameters. Imagine, for example, that we’ve just been introduced to the MNIST problem, and have begun working on it, knowing nothing at all about what hyper-parameters to use. Let’s suppose that by good fortune in our first experiments we choose many of the hyper-parameters in the same way as was done earlier this chapter: 30 hidden neurons, a mini-batch size of 10, training for 30 epochs using the cross-entropy. But we choose a learning rate η=10.0 and regularization parameter λ=1000.0. Here’s what I saw on one such run:

>>> import mnist_loader
>>> training_data, validation_data, test_data = \
... mnist_loader.load_data_wrapper()
>>> import network2
>>> net = network2.Network([784, 30, 10])
>>> net.SGD(training_data, 30, 10, 10.0, lmbda = 1000.0,
... evaluation_data=validation_data, monitor_evaluation_accuracy=True)
Epoch 0 training complete
Accuracy on evaluation data: 1030 / 10000

Epoch 1 training complete
Accuracy on evaluation data: 990 / 10000

Epoch 2 training complete
Accuracy on evaluation data: 1009 / 10000


Epoch 27 training complete
Accuracy on evaluation data: 1009 / 10000

Epoch 28 training complete
Accuracy on evaluation data: 983 / 10000

Epoch 29 training complete
Accuracy on evaluation data: 967 / 10000

Our classification accuracies are no better than chance! Our network is acting as a random noise generator!

“Well, that’s easy to fix,” you might say, “just decrease the learning rate and regularization hyper-parameters”. Unfortunately, you don’t a priori know those are the hyper-parameters you need to adjust. Maybe the real problem is that our 30 hidden neuron network will never work well, no matter how the other hyper-parameters are chosen? Maybe we really need at least 100 hidden neurons? Or 300 hidden neurons? Or multiple hidden layers? Or a different approach to encoding the output? Maybe our network is learning, but we need to train for more epochs? Maybe the mini-batches are too small? Maybe we’d do better switching back to the quadratic cost function? Maybe we need to try a different approach to weight initialization? And so on, on and on and on. It’s easy to feel lost in hyper-parameter space. This can be particularly frustrating if your network is very large, or uses a lot of training data, since you may train for hours or days or weeks, only to get no result. If the situation persists, it damages your confidence. Maybe neural networks are the wrong approach to your problem? Maybe you should quit your job and take up beekeeping?

In this section I explain some heuristics which can be used to set the hyper-parameters in a neural network. The goal is to help you develop a workflow that enables you to do a pretty good job setting hyper-parameters. Of course, I won’t cover everything about hyper-parameter optimization. That’s a huge subject, and it’s not, in any case, a problem that is ever completely solved, nor is there universal agreement amongst practitioners on the right strategies to use. There’s always one more trick you can try to eke out a bit more performance from your network. But the heuristics in this section should get you started.






是否能仿效取法自身『身』經絡『經 絡』呢??!!

如果說有一門研究『夢的科學』稱作夢學 Oneirology ,為什麼直到今天『經絡』還排徊在『主流科學圈』之外?難到『針灸』真能以其『無用性』就可存在了數千年??但是『理論』之『實效性』並不能夠『釋疑』,就彷彿『熱力學』的『』,直到玻爾茲曼用著『統計力學』來『定義』,它的『意義』或許方被釐清!!所以在那還沒有『系統理論』的古早之前,就有《黃帝內經》之『經絡系統』的『五臟六腑』體系論述,實在是『很可疑』!這樣看來《周公解夢》也就是解『非理性之夢』的了!!

Meridians 【經絡】in acupuncture 【針灸】and infrared imaging
Shui-yin Lo

Summary The meridians in acupuncture are hypothesized to be made up of polarized molecules. Quantum excitations, quasi-particles and others are assumed to be the media of communication between different parts of the body connected by meridians. Infrared pictures are taken to depict the effect of acupuncture on one acupoint of a meridian to a far away pain area.

Acupuncture has been around for many thousands of years in China and has achieved good results in both man and animals. It has also recently begun to gain wide acceptance in the West. However, despite many scientific studies, it has still failed to achieve the recognition 【承認】it needs within mainstream orthodox scientific circles. Many studies over the past 40 years have shown that electric conductivity 【導電性】on acupuncture points (1±4) is lower than that on neighboring points. One of the most recent studies has been carried out using functional magnetic resonance imaging (fMRI)【功能性磁振造影】; it has reported the correlation between vision acupoints in the foot and corresponding brain cortices 【皮層】. When acupuncture stimulation is per formed on a vision-related acupoint (located on the lateral aspect of the foot), fMRI shows activation of the occipital lobes 【枕葉】. Stimulation of the eye using direct light results in similar activation in the occipital lobes when visualized by fMRI.

Two main questions need to be answered in a modern scientific way:

1. What are meridians?

2. What is the qi 【氣】that is supposed to circulate around the meridians?

The theory behind acupuncture is that the body has a system of meridians which channel 【形成河道】some kind of substance, energy, or information that has been vaguely called qi in the literature. Unfortunately, so far, when one dissects 【解剖】the human body, one does not find any substance that distinguishes the meridians from their surrounding tissues, quite unlike other human systems such as the nerve system or the blood system. Therefore the most likely explanation is that meridians are made up of same ordinary molecules that make up other living materials surrounding them with the exception that they are more ordered. These ordered molecules are neutral but electrically polarized. This provides the natural explanation on the concept of the balance of yin 【陰】and yang 【陽】in Chinese medicine as the neutralization 【中和】of negative and positive charges in electricity. Our hypothesis is then as follows:

The meridians are made up of electrically polarized molecules. On the meridians there are quantum phenomena 【量子現象】such as excitations, quasi-particles 【準粒子】, etc. that account for significant properties of meridians. These polarized molecules line up their polarity to form bigger clusters 【叢集】. Specifically, they are most likely water molecules 【水分子】that group together to form water clusters, which have permanent electric dipole moment. These water clusters then line up together to form the meridians. It has been suggested for a long time that water plays a very active role in the living state of the human body.






