February 2006 Archives
2006-02-28 22:04:09
Note on Lab Meeting 20060228
- Title : Linear and Order Statistics Combiners for Pattern Classification
Source : Combining Artificial Neural Networks (Amanda Sharkey, editor)
Author : Kagan Tumer and Joydeep Ghosh
Speaker : ÄÁ¤¹ª@
Time: 2.28 (Tuesday) 13:00-15:00
Those curves are posterior probabilities, i.e., for a given x, the probability of classifier classify it into class i(or j).
Shaded region is Bayes error region; if we know the distributions of two posterior probabilities, then we can determine the optimum boundary; therefore, the Bayes error region is essential. The obtained distribution of posterior probability always include error. That is, we choose a Obtained Boundary according to the obtained distributions of posterior probabilities. And the black region is "Added error region".
Using Linear combining
If the error(bias and noise) of different classifiers are i.i.d., then we use linear approximation to estimate error region, and linear combining those. We will got the average error:
$E^{avg}_{add}=\frac{E_{add}}{N}$
i.e., average error equals 1/N of each error.
If bias is not i.i.d., then
$E^{avg}_{add}=E_{add}(\frac{1+\delta(N-1)}{N})$
Order Statistics
Similar.
2006-02-26 23:12:53
Course of NCTU, Quantum Computation, Spring 2006
Syllabus:
1. Introduction to quantum mechanics
2. Introduction to Computation theory
3. Quantum circuits
4. Quantum Fourier transform and its applications
5. Finding hidden subgroup
6. Quantum search
7. Quantum Integer factorization
8. Quantum Circuit lower bounds
Refewnce:
1. Quantum Computation and Quantum Information, Michael Nielsen and Isaac
Chuang, Cambridge University Press, 2000.
2. Handouts.
Time and Location:
1. 18:30~21:20, Tuesday. Room 305, ¤uµ{¥|À], NCTU.
2. 18:30~21:20, Thursday. Room 302, ¤uµ{¥|À], NCTU.
2006-02-23 16:44:28
Research Method
I had read a book, "The Craft of Research," by Wayne Booth, Gregory G. Colomb, Joseph M. Williams. This well-known book seems for students of social sicence. Although there are some advices useful, this book lacks the research method of science.
From some uselful articles on internet such as How to Succeed in Postgraduate Study, by Marie desJardins, I got the informations about research method book of sicence. The following are some books.
-
Writing the Laboratory Notebook, by Howard M. Kanare (I checked out this book from NCU.)
An Introduction to Scientific Research by E. Bright Wilson (NTHU)
Writing Successful Science Proposals, by Andrew J. Friedland, Carol L Folt (NCTU)
Communicating in Science : Writing a Scientific Paper and Speaking at Scientific Meetings, by Vernon Booth (NTHU)
The Craft of Scientific Presentations : Critical Steps to Succeed and Critical Errors to Avoid, by Michael Alley (NTHU)
The Craft of Scientific Writing, by Michael Alley (NCTU)
At The Helm: A Laboratory Navigator, by Kathy Barker (NTHU)
Lab Dynamics: Management Skills for Scientists, by Carl M. Cohen, SUZANNE L. COHEN
2006-02-20 22:37:31
Some interesting courses on OCW
18.433 Combinatorial Optimization, Fall 2003
18.314 Combinatorial Analysis, Fall 2005
18.312 Algebraic Combinatorics, Spring 2005
18.997 Topics in Combinatorial Optimization, Spring 2004
18.435J / 2.111J / ESD.79J Quantum Computation, Fall 2003
6.854J / 18.415J Advanced Algorithms, Fall 1999
6.854J / 18.415J Advanced Algorithms, Fall 2001
18.409 Behavior of Algorithms, Spring 2002
18.404J / 6.840J Theory of Computation, Fall 2002
18.315 Combinatorial Theory: Hyperplane Arrangements, Fall 2004
6.050J / 2.110J Information and Entropy, Spring 2003
6.231 Dynamic Programming and Stochastic Control, Fall 2002
6.854J / 18.415J Advanced Algorithms, Fall 2001
6.243J / 2.156J / 16.337J Dynamics of Nonlinear Systems, Fall 2003
2006-02-20 00:01:47
Theory of cellular automata: A survey
In fact, I sent a mail with seven papers to ask prof. Tang to choose one. It is lucky that prof. Tang told me that each one is good as material for presenting at Lab meeting. The seven papers which I chose can be classified into three classes, complex science, complex graph (random graph), and combinatorial algorithms. Why did I choose them? The reason is that I want to study theory and it can connect to the research of our Lab. Since biological systems are one kind of complex systems, I think theory about complex science, such as CA and scalar-free graph, will be good choices.
The question which reside in my heart long time is the topics of my thesis.
I recall that my senior, Dr. Chen, said "Master degree of theory study is meaningless. Two years are too short to learn enough to do real research." Personally, I think the master program as a preparation of Ph.D. But it really made me to think the future seriously.
Let's come back to the topic of thesis. Actually, the topic doesn't matter. What I ask repeatedly is the direction I really want to spend my life on. I observed myself in many aspects; I seem not a programmer, not a combinatorialist, not a engineer. Combinatorics is interesting, programming too. But I can't keep my interests on one and one problem. I more like to think some intrinsic problems.
The first time I like math is because of combinatorics. But I still want to join some Labs about OS or VLSI design. At the final year of my undergraduate days, I attend the Algebra I,II of dept. of math. That is first time I think the math greater than man's mind.
I thought that math is developed by a group of geniuses for fun or for solving real problems. After I learned algebra, I thought that math is with intrinsic great properties. It must come form THE BOOK, like physics. I want to present this paper because I want to connect the real world and math.
2006-02-18 00:48:56
Computer Science and Its Relation to Mathematics
The most concise and funny story I ever heard to explain the relationship between CS and Math is in above article written by Knuth.
2006-02-17 18:58:52
Gian-Carlo Rota
Overview and history
Modern combinatorics began to develop in the late nineteenth century and became a distinguishable field of study in the twentieth century, partly through the publication of the systematic enumerative treatise Combinatory Analysis by Percy Alexander MacMahon in 1915 and the work of R.A. Fisher in design of experiments in the 1920s. Two of the most prominent combinatorialists of recent times were the prolific problem-raiser and problem-solver Paul Erd?s, who worked mainly on extremal questions, and Gian-Carlo Rota, who helped to formalize the subject beginning in the 1960s, mostly in enumeration and algebraization.
Paul Erdos is really a famous mathematician, but Gian-Carlo Rota seems not. According to the description from his entry on Wikipedia, he did great work on combinatoricsHis series of ten papers on "Foundations of Combinatorics" in the 1960s is credited with making it a respectable branch of modern mathematics. He said that the one combinatorial idea he would like to be remembered for is the correspondence between combinatorial problems and problems of the location of the zeroes of polynomials.[1] He worked on the theory of incidence algebras (which generalize the 19th-century theory of M Ébius inversion) and popularized their study among combinatorialists, set the umbral calculus on a rigorous foundation, unified the theory of Sheffer sequences and polynomial sequences of binomial type, and worked on fundamental problems in probability theory.
Why does he be ignored?2006-02-17 16:12:29
Articles about Academia From The Becker-Posner Blog
Retirement:
Comment on Retirement--BECKER
Refusing to Retire: What Can Be Done When People Overstay Their Welcome?
Tenure:
Comment on Tenure-BECKER
Tenured Employment--Posner
Tenure--Posner's Reply to Comments
Response on Tenure-BECKER
For-Profit Colleges:
For-Profit Colleges and Universities--Posner's Comment
For-Profit Colleges-BECKER
On For-Profit Colleges Again-BECKER
2006-02-16 03:46:48
How to Succeed in Postgraduate Study
The Daily Grind:
Keeping a journal of your research activities and ideas is very useful. Write down speculations, interesting problems, possible solutions, random ideas, references to look up, notes on papers you've read, outlines of papers to write, and interesting quotes. Read back through it periodically. You'll notice that the bits of random thoughts start to come together and form a pattern, often turning into a research project or even a thesis topic. I was surprised, looking back through my journal as I was finishing up my thesis, how early and often similar ideas had cropped up in my thinking, and how they gradually evolved into a dissertation.
Keep the papers you read filed away so you can find them again later, and set up an online bibliography (BibTeX is a popular format, but anything consistent will do). I find it useful to add extra fields for keywords, the location of the paper (if you borrowed the reference from the library or a friend), and a short summary of particularly interesting papers. This bibliography will be useful for later reference, for writing your dissertation, and for sharing with other postgraduate students (and eventually, perhaps, supervising).
Those advices are well-known, but not everyone could do it well. As I ever mentioned, Dr. Shalizi's website is great, I don't know that why he has so much time to write down such articles. He must have good habits and techniques of gathering and managing informations. I want to follow those advices but I always hastily put papers on desktop when I am surveying.Finding a Thesis Topic:
A good source of ideas for honours and master's projects (and sometimes for PhD topics) is the future work section of papers you're interested in. Try developing and implementing an extension to an existing system or technique.
In order to do original research, you must be aware of ongoing research in your field. Most students spend up to a year reading and studying current research to identify important open problems. However, you'll never be able to read everything that might be relevant -- and new work is always being published.
2006-02-16 03:45:20
Quotations From Michael Faraday
``The five essential entrepreneurial skills for success are concentration, discrimination, organization, innovation and communication.''
``Why, sir, there is every possibility that you will soon be able to tax it! (to PM William Gladstone, on the usefulness of electricity)''
In fact, I looked for the one of my favorite quotations from Faraday, which I read it from a chinese biography of Faraday. But I can't find that...:(2006-02-15 10:30:32
Habits of Working
2006-02-15 08:09:20
Winners of math medal study combinatorics.
Fields Medal
-
William T. Gowers (Cambridge Univ.), for his work in functional analysis and combinatorics
Gregori Aleksandrovich Margulis Provided innovative analysis of the structure of Lie groups. His work belongs to combinatorics, differential geometry, ergodic theory, dynamical systems, and Lie groups.
Jean Bourgain His work is in various areas of mathematical analysis such as the geometry of Banach spaces, harmonic analysis, analytic number theory, combinatorics, ergodic theory and partial differential equations.
-
Hassler Whitney for his fundamental work in algebraic topology, differential geometry and differential topology.
PAUL ERDOS, Hungarian Academy of Sciences, Budapest, Hungary, for his numerous contributions to number theory, combinatorics, probability, set theory and mathematical analysis, and for personally stimulating mathematicians the world over.
LASZLO LOVASZ, Yale University, New Haven, Connecticut, U.S.A., and Eotvos University, Budapest, Hungary, for his outstanding contributions to combinatorics, theoretical computer science and combinatorial optimization.
GREGORY A. MARGULIS, Yale University, New Haven, Connecticut, USA, for his monumental contributions to algebra, in particular to the theory of lattices in semi-simple Lie groups, and striking applications of this to ergodic theory, representation theory, number theory, combinatorics, and measure theory
2006-02-13 06:44:50
Complex Systems or Complex Science
I love this book, the idea of complexity is amazing. And The great part is this story is unfinished (at least, in the book.) Unlike other books about subjects of science, the author said : "This is a book about the science of complexity-a subject that's still so new and so wide-ranging that nobody knows quite how to define it, or even where its boundaries lie." Books about other subjects of science usually told readers the stories of its glory days and history, such as physics, quantum theory. That make me thought I might study it when I am a scientist.
That's one of reasons I overtook the paper "Methods and Techniques of Complex Systems Science: An Overview " to read.
But another question raised. Why don't I hear many discussion on it?
In the formal education, I hardly heard people discuss about complexity in Taiwan. There are two possible answers, one is that the level of science research in Taiwan is behind the top level of world very much, the other is that the achievement of complex science is not good as expecting.
How do other scientists think about complexity science?
Science Magazine, April 1999, Vol 284, Special Issue "Complex Systems" might be a good start point.
Btw, Cosma Rohilla Shalizi's Website is great!
2006-02-12 15:33:37
Methods and Techniques of Complex Systems Science: An Overview (Part II)
Ref. Methods and Techniques of Complex Systems Science: An Overview (Part I)
My original thought is to note what I read in this paper, but it is difficult to write down without misunderstanding.
Section II B. Choice of Architecture
The basic idea of data mining is to fit a model with minimal assumptions about what the correct model should be, or how the variables in the data are related.
This subsection seems about Non-parametric statistics, but some terms, such as kernel method, casual inference, in this subsection seem to be topics of pattern recognition or AI.Section III Time series analysis
-
The old school of time series analysis:
They view the time series as samples from a stochastic process, and applies a mixture of traditional statistical tools ans assumptions(linear regression, the properties of Gaussian distributions) and the analysis of the Fourier spectrum.
The new school of time series analysis:They view time series as distorted or noisy measurements of an underling dynamical system, which it aims to reconstruct.)
I can't differ agent-based model and CA, clearly. They both look like variants of automata....
The Complexity Measures part are interesting. But the name "algorithmic complexity" is weird. As I know the Kolmogorov complexity should be called descriptive complexity.
Further Reading:
-
General: Shalizi CR, Crutchfield JP, "Computational Mechanics: Pattern and Prediction, Structure and Simplicity. Author said it is one of best available survey about the complexity (of complex science) and required grad. level statistical mechanics.
Data mining and Statistical Learning: This field seems very very important in complex science. The author Dr. Shalizi spent half of pages to introduce this fields, and it seems really a powerful tools. But I am not interested in it currently.
Time series: Same as above. The part of symbolic dynamics seems more interesting, but it seems very difficult.
Cellular Automata: This field is what I am interested in at first. But I am more interested in the automata-theoretic aspect not modeling. Cellular Automata : A Discrete Universe, by Ilachinski, Andrew., Cellular Automata: Theory and Experiment by Howard Gutowitz (Editor), Cellular Automata and Complexity by Stephen Wolfram, New Constructions in Cellular Automata by David Griffeath, Cristopher Moore.
Agent-Based Modeling: Self-Organization in Biological Systems by Scott Camazine, Jean-Louis Deneubourg, Nigel R. Franks, James Sneyd, Guy Theraula, Eric Bonabeau, The Complexity of Cooperation: Agent-Based Models of Competition and Collaboration by Robert Axelrod. I just want to check the differences between CA and ABM.
Information Theory: The relationship between physics and information seem funny. Complexity, Entropy and the Physics of Information: The Proceedings of the 1988 Workshop on Complexity, Entropy, and the Physics of Information Held by Wojciech Hubert Zurek, Crystalline Computation, Norman Margolus.
Complexity Measure: I don't know its usage, but it seems funny topics.
2006-02-12 10:14:38
List of publications in computer science
However, there is a great article about computer science of Wikipedia. List of publications in computer science
This is a list of important publications in computer science, organized by field.
2006-02-11 01:06:03
Update Nanoblogger
2006-02-11 00:17:22
Quotation from John von Neumann
The sciences do not try to explain, they hardly even try to interpret, they mainly make models. By a model is meant a mathematical construct which, with the addition of certain verbal interpretations describes observed phenomena. The justification of such a mathematical construct is solely and precisely that it is expected to work.
2006-02-10 02:41:19
The sole end of science is the honor of the human mind
Carl Gustav Jakob Jacobi
It is often more convenient to possess the ashes of great men than to possess the men themselves during their lifetime.
[Commenting on the return of Descartes' remains to France]
In H. Eves Mathematical Circles Adieu, Boston: Prindle, Weber and Schmidt, 1977.
The real end of science is the honor of the human mind.
In H. Eves In Mathematical Circles, Boston: Prindle, Weber and Schmidt, 1969.
It is true that Fourier had the opinion that the principal aim of mathematics was public utility and explanation of natural phenomena; but a philosopher like him should have known that the sole end of science is the honor of the human mind, and that under this title a question about numbers is worth as much as a question about the system of the world.
In N. Rose Mathematical Maxims and Minims, Raleigh NC:Rome Press Inc., 1988.
2006-02-10 02:27:44
Principles of Effective Research
Research and pursue truth might be meaningful, but it can't be all of their life. There are so many exciting things to do. Why do they stay in doing research?
It might be a answer.
Ref. Michael A. Nielsen, Principles of Effective Research
Fundamental principles
The fundamental principles of effective research are extremely similar to those for effectiveness in any other part of life. Although the principles are common sense, that doesn't mean they're common practice, nor does it mean that they're easy to internalize. Personally, I find it a constant battle to act in accord with these principles, a battle requiring ongoing reflection, rediscovery and renewed commitment.
Integrating research into the rest of your life
Research is, of course, only a part of life, and must be understood in relation to the rest of life. The foundation of effective research is a strong motivation or desire to do research. If research is not incredibly exciting, rewarding and enjoyable, at least some of the time, then why not do something else that is? For the purposes of this essay, I'll assume that you already have a strong desire to do research [*].
[*] People sometimes act or talk as though desire and motivation cannot be changed. Within limits, I think that's wrong, and we can mold our own motivations. But that's a subject for another essay.
Motivation and desire alone are not enough. You also need to have the rest of your life in order to be an effective researcher. Make sure you're fit. Look after your health. Spend high quality time with your family. Have fun. These things require a lot of thought and effort to get right. If you don't get them right, not only will your life as a whole be less good, your research will suffer. So get these things right, and make sure they're integrated with your research life.
As an example, I once spent three years co-authoring a technical book, and for the final eighteen months I concentrated on the book almost exclusively, to the neglect of my health, relationships, and other research. It is tempting to ask the question "Was the neglect worth the benefits?'' But that is the wrong question, for while the neglect paid short-term dividends in increased productivity, over the total period of writing the book I believe it probably cost me productivity, and it certainly did after the book was complete. So not only did I become less fit and healthy, and see my relationships suffer, the book took longer to complete than if I'd had my life in better order.
2006-02-09 20:20:49
Simplex Method
I still can't understand the simplex method; It's amazing! I must misunderstand or miss something!
Ref. Chvatal, Chapter 2, Linear Programming, Freeman, 1983.
In general, given a problem
Maximize $\sum^n_{j=1} c_jx_j$
subject to $\sum^n_{j=1} a_{ij}x_j\leq b_i$ ($i=1,2,\cdots,m$)
$x_j\geq 0$ ($j=1,2,\cdots,n$)
$x_{n+i}=b_i - \sum^n_{j=1} a_{ij}x_j$ ($i=1,2,\cdots,m$)
$z=\sum^n_{j=1}c_jx_j$.
Every solution of the set of equations comprising a dictionary must be also a solution of (2.13), and vice versa.(pp. 18, (2.13) means the math formulation of the problem.)
The equations of every dictionary must express $m$ of the variables $x_1, x_2,\cdots,x_{n+m}$ and the objective function $z$ in terms of the remaining $n$ variables.(pp. 19)
Books of LP are introducing much different terminologies, like basic, feasible solution, dictionary, feasible dictionary, and so on, but they doesn't tell me which terminologies are important and meaningful.They said, the variables in left-hand side are called basic, the variables in the right-hand side are called nonbasic, and the basic variables are said to constitute a basis; the formula for the leaving variable appears in the pivot row of the dictionary; the computational process of constructing the new dictionary is referred to as pivoting.
The simplex method is a procedure that build a dictionary; find a initial solution; pivoting (chose a nonbasic variable to become basic variable and build a new dictionary.), and then find out a better feasible solution.
In each pivoting, $z=\sum^n_{j=1}c_jx_j$. will become less flexible. (more and more terms will be with negative sign.) The constant term will enter the objective equation like $z=c+\sum^n_{j=1} d_jx_j$.(Note that the $c$ term in objective equation will restrict the feasible solution. Although each dictionary will be equivalent, but the optimal solution will be produced step by step because we will pick the pivoting variable from the objective equation for maximizing $z$.)
At beginning, I don't understand that how to produce the optimal solution since each dictionary are equivalent. The answer is in the objective equation. The change of objective equation will make us have less and less choice for improvement.
Ref. Chvatal, Chapter 3, Linear Programming, Freeman, 1983.
- If $b_i$ will not nonnegative, it might be without feasible solutions. (The author discussed it on pp.39-42.)
The trouble with an infeasible origin is twofold. First, it may not be clear that our problem has any feasible solutions at all. Second, even if a feasible solution is apparent, a feasible dictionary may not be.
The strategy is known as the two-phase simplex method. In the first phase, we set up and solve the auxiliary problem; if the optimal dictionary turns out to have the property (3.15) then we proceed to the second phase, solving the original problem itself.The property (3.15) is on pp. 41 of this book. i.e., $x_0$ is nonbasic, and so the value of $w$ is zero. This property about auxiliary problem is respect to the relation of auxiliary problem's solution and original problem's solution. A important key point is the policy to choose leaving variable of candidates: $x_0$ is prior. But I don't really understand its argument. Why does the original problem be infeasible, if the property, $x_0$ is basic and the value of $w$ is zero, held?
If no suitable variables to be chosen as entering variable, then it means current solution optimal. If candidates more than one, any one of them will OK!
We chose the leaving variable that is that basic variable whose non-negativity impose the most stringent upper bound on the increase of the entering variable.
Simplex iterations that do not change the basic solution are called degenerate. It is harmless but annoying.
Theorem 3.1 If the simplex method fails to terminate, then it must cycle.
Note that cycle must occur in the presence of degeneracy. Cycling is rare in LP so that regardless on most implementation. There are some ways to avoid cycling.-
perturbation method
lexicographic method
smallest-subscript rule
Every LP problem in the standard form has the following three properties:Simplex method is a clever and powerful method, but its procedure is not so clear. There are many pitfalls in each steps.If it has no optimal solution, then it is either infeasible or unbound. If it has a feasible solution, then it has a basic feasible solution. If it has an optimal solution, then it has a basic optimal solution.
2006-02-08 00:15:25
Methods and Techniques of Complex Systems Science: An Overview (Part I)
"Methods and Techniques of Complex Systems Science: An Overview," forthcoming as a chapter in Thomas S. Deisboeck and J. Yasha Kresh (eds.), Complex Systems Science in Biomedicine (Kluwer Academic) = nlin.AO/0307015. A 27,000 words summary of the tools people should use to study complex systems, covering statistical learning and data-mining, time series analysis, cellular automata, agent-based models, evaluation techniques and simulation, information theory and complexity measures, with 265 references (a personal record). I'm told the actual book should appear either in late 2004 or early 2005.(Information of this paper from auther's website.)
I chose this paper for presentation on Lab meeting. This field, complex systems science, seems most intersection of fields which I am interested in. I hope this paper can guide me to know what I need to learn. And this paper is for biomedical research, it satisfy the direction of our Lab.
Ref. [nlin/0307015] Methods and Techniques of Complex Systems Science: An Overview
In this chapter, I review the main methods and techniques of complex systems science. As a first step, I distinguish among the broad patterns which recur across complex systems, the topics complex systems science commonly studies, the tools employed, and the foundational science of complex systems. The focus of this chapter is overwhelmingly on the third heading, that of tools. These in turn divide, roughly, into tools for analyzing data, tools for constructing and evaluating models, and tools for measuring complexity. I discuss the principles of statistical learning and model selection; time series analysis; cellular automata; agent-based models; the evaluation of complex-systems models; information theory; and ways of measuring complexity. Throughout, I give only rough outlines of techniques, so that readers, confronted with new problems, will have a sense of which ones might be suitable, and which ones definitely are not.
The techniques of complex systems science:
-
For analyzing data
In this part, the popular techniques are statistical learning, data mining, time series analysis
For building and understanding modelsnonlinear dynamics, cellular automata, agent-based models
For measuring complexityinformation theory
Data-mining is useful. It can tell us what kinds of patterns are in the data, and so guide what our model-building.
-
Prediction is sometimes simply taken as regression. Regression analysis
We need to design loss function L to evaluate the performance of our prediction machine.
$L(y,f(x,\theta))$ means empirical loss for particular values $x\in X$ and $y\in Y$, and $L(Y,f(X,\theta))$ means the loss from the error in those prediction, here $\theta$ means the adjustment parameters of $L$.
The procedure to minimizes $L(y,f(x,\theta))$ to modify $\theta$ is called empirical risk minimization(ERM).
$L(y,f(x,\theta))$ is called training error, in-sample error or empirical error.
The error we can expect on new data, $E[L(\theta)]$, is called generalization error, out-of-sample error or true error.
$L(y,f(x,\theta))=E[L(\theta)]+\epsilon$, where $\epsilon$ means the sampling fluctuation. Refer to Note on Statistics, I thought that $E[L(\theta)]$ is a kind of point estimation(?), and $L(Y,f(X,\theta))$ is a parameter of population.
ill-posed problem : The Nature of Statistical Learning Theory by Vladimir N. Vapnik
Over-Fitting Problem : Three commonest forms to avoid over fitting.
-
Cross-Validation :
Produce many random training sets and test sets by many different ways to avoid over-fitting. Regularization or Penalization :
If one machine with fifty knobs, and another fits the data as well but has only a single knob, we should (the story goes) chose the latter --- because it's less flexible, the fact that it does well is good indication that it will still do well in the future.
Usually, regularization methods are justified by the idea that models can be more or less complex, and more complex ones are more liable to over-fit, all else being equal, so penalty terms should reflect complexity.
Capacity ControlThe core concept of Capacity Control seems a kind of approximation method. By studying the function of sample size and some other value like VC dimension, we can calculate that how in-sample error close to generalization error in specified $\theta$ to get the confidence interval of this approximation. By this function, we can calculate that how in-sample error close to generalization error in specified $\theta$
But it's to rough. I need to learn related knowledges.
Ref. VC dimension - Wikipedia, the free encyclopedia
2006-02-07 04:55:18
Google Scholar and NTHU
Google Scholar --> Scholar Preference --> Library Links
Key in "NTHU" and click the button of "Find Library".
After finishing above steps, the search results of Google Scholar with the label "SFX@NTHU" mean that those data can connect directly to the e-resources of NTHU library.