The Department of Computer Science hosts the world-famous computer scientist and inventor of Genetic Algorithms (GA's), Dr.
Genalytics Model 6.0 transforms large amounts of transaction data into predictive intelligence and uses genetic algorithms to automatically search and identify key combinations of time series variables found in any dataset.
Starting today, past and present IlliGAL lab members and affiliates will team up to blog on genetic algorithms, evolutionary computation, and related subjects. GECCO, I was re-reading Sam Mahfoud's paper titled "population sizing for Sharing methods". Since niching mechanism is one of the key components of hierarchical and multiobjective genetic algorithms, the population-sizing paper deserves renewed attention.
Over a number of years, I've followed research in iGAs and some time ago, I started to collect information about market research and product development with the idea of founding a company to do just that using interactive GA technology.
Scalable Parallel Random Number Generators Library (SPRNG): An RNG library that combines various available random number generators and supports both serial and parallel platforms.
Mersenne Twister (MT): An RNG which has a period of 2^19937-1 with an assured 623-dimensional equidistribution property (Thanks to Jaume for pointing me to MT). Experiments showed that a lower mixing rate increases both the population size needed and the convergence time. Business Week article gives an overview of BBN's Boomerang project now fielded in Iraq to protect Allied troops by detecting and locating sniper fire. This story is fine as far as it goes, but as GA afficianadoes well know, the BBN effort was supported by a crack team of card-carrying genetic algorithmists.
This year's Genetic and Evolutionary Computation Conference (GECCO-2005) is 25-29 June 2005 in Washington DC. Those with artistic or creative talent should try their skills at designing the new SIGEVO logo as the competition ends February 7th. Martin Pelikan's book titled "Hierarchical Bayesian Optimization Algorithm: Toward a New Generation of Evolutionary Algorithms" is now available.
More details regarding hBOA and to download the software visit the hBOA page and Martin's webpage.
Lately, I have moved from WEKA to D2K, a data mining framework developed by the Automated Learning Group at the National Center for Supercomputing Applications. We all know that computational complexity of GAs depends primarily on the population size and the number of generations. What I like on using the bisection method is that all parameter tweaking will be done automatically (and thus I don't have to do it myself), because the remaining GA parameters do not influence time complexity as much (mostly by some constant factor and that's not so important when we talk about asymptotic upper bounds).
Of course, the above thinking is correct only if larger populations lead to better solutions (given enough generations we should all be able to believe this fact) and if smaller populations do not significantly increase the number of generations (in all cases I studied, any sufficiently large population size lead to about the same number of generations, as can be expected for many variants of evolutionary algorithms based primarily on recombination). Technology Review article also highlights the history of evolutionary computation a€“ briefly mentioning the work of Holland, Goldberg, Koza, and others. Theory and Applications of Efficient Multi-Objective Evolutionary Algorithms (CS, Feng Chia University, 2004) was ranked number one among all CS dissertations in a national competition held under the auspices of the Institute of Information & Computing Machinery (IICM). Oregon Daily Emerald reports that James Bean, noted operations researcher, genetic algorithmist, and inventor of the random keys technique for permutation and other combinatorial problems, is using genetic algorithms to help develop sustainable automobile technology in his new role as Dean of the Lundquist College of Business. The usage of EDAs can help addressing two major issues in XCS: (1) knowledge extraction and (2) structure identification.
Toronto Globe and Mail reports that researchers at Laurentian University are using genetic algorithms to schedule mining of ore deposits. Science Daily posts a University of Michigan press release entitled Falling Prey to Machines? According to Holland, the problem with developing artificial intelligence through things like genetic algorithms is that researchers don't yet understand how to define what computer programs should be evolving toward. According to Holland, advances in software have not kept pace with the exponential improvements in hardware processing power, and there are many artificial intelligence problems that cannot be solved by simply performing more calculations.
Acovea (analysis of compiler options via evolutionary algorithm) is an open source project which utilizes GAs to tune the optimization flags for GCC to yield the fastest code specifically for your computer. Another reason for the current situation is that human lifespans are no longer a good match to the turnover in intellectual ideas. So, it is in this sense, that the acceptance of SIGEVO by ACM couldn't come at a better time, and should be seen as a very positive thing.
Of course, department heads when asked about tenure and promotion will tell you that each case is special and is scrutinized without regard for brand names.
It will take some time to know the overall effect, and indeed, joining a large bureaucratic organization like ACM will come with its share of constraints and costs. Sam Williams's recent article "Unnatural Selection" in Technology Review is getting some blog traction.
Another script I use very often is a perl wrapper for gnuplot, the plotting program from the GNU project. This script receives as parameters a variable-length number of data files to overlap in the same plot. Don't forget, the new motherboards and processors drain a lot of power, so you need about 1.5 Amps per node. ATA Engineering announced the release of version 1.0 of the Attune package for test-analysis correlation and model updating. In beta testing, a key aerospace customer used Attune to carry out hundreds of real-time correlation iterations during modal testing of a launch vehicle payload, whereas prior to this they were limited to less than a dozen manual correlations a week. Additional information is available from the Attune product website or the full press release.
Finally, a third way of approaching animation synthesis was presented by Torsten Reil, from Natural Motion.
Now that marijuana has been legalized in at least 18 states, does this mean that drivers are exempt from DUI of marijuana? The answer will vary based on the driversa€™ THC levels and the state in which the driver was pulled over.
Differing states have permissible amounts outlined to determine if the driver should be charged with a DUI.
One way to find accurate information about drugged driving, state limits, and the relevant penalties is to visit the statea€™s official DMV or Department of Motor Vehicles.
If youa€™ve been recently arrested for DUI of marijuana, get in touch with a lawyer who specializes in this niche. By studying time series patterns, organizations can identify credit risks, anticipate customer attrition, and detect probable bankruptcies. In April 1994, a graduate student named Kate Sherwood (Ducky) put up the first IlliGAL web page.
In the last American election cycle, blogging grew as a force to rival the mainstream media. The population-sizing estimate though derived for fitness sharing, also holds for other niching methods such as restricted tournament selection as well. Ever since Dawkins famous Blind Watchmaker code, and Caldwell and Johnston's Faceprints work at ICGA 91, the idea of having user-driven subjective functions in place of predetermined objective functions has opened the door to evolving art, music, poetry, and more. My dreams of a great startup were dashed one semester when during my genetic algorithms course, some students researching their class project uncovered a company that was already vigorously pursuing interactive GAs in marketing applications. A simple idea such as a round robin of small disjoint training sets greatly helps Pittsburgh classifier systems in the quest for generality. Not only is the math difficult, but the system also had to work in the cacophony of urban warfare, including echoes from shots -- and do it on the move. An article written by BBNers Hussain, Montana, Brinn & Cerys entitled Genetic algorithms for UGV navigation, sniper fire localization, and unit of action fuel distribution was presented last June at GECCO 2004 at the Workshop on Military and Security Applications of Evolutionary Computation (MSAEC-2004). In his book, Martin presents a principled method for designing and analyzing scalable genetic algorithms that can solve boundedly-difficult, hierarchically-decomposable problems in polynomial (usually sub-quadratic) time.
I know Martin and Jaume have also been using some of this tools too in their data mining related papers. LIBSVM is an integrated software for support vector classification, (C-SVC, nu-SVC ), regression (epsilon-SVR, nu-SVR) and distribution estimation (one-class SVM ).
Given that we know the optimum, the number of generations can be determined by the GA itself---when the GA finds the optimum or the entire population converges to it (or gets close enough), we've got it. The basic idea is to first determine a reasonable initial interval (min,max) of population sizes where min is too small and max is too large.
To eliminate noise, the bisection can be run several times and the results of all bisection runs can be averaged (right now I use 10 rounds of bisection, each with 10 runs).

To determine whether or not the second condition is at least partially satisfied, one can try one problem size (rather larger than smaller) and test whether the overall complexity decreases for larger populations than the one determined by the bisection.
From jet engine fan blade design to minimizing power consumption of long-distance pipelines, the article cites several applications of genetic algorithms and genetic programming. Recently, such approaches are slowly infiltrating in the learning classifier systems world.
Knowledge extraction addresses the issue of mining problem knowledge from the final solution developed by XCS. He told me that the problematic GA runs would converge to a mediocre solutions, but different parts within those solution were of good quality.
Thesis Symmetry in the Representation of an Optimization Problem and related papers, available as IlliGAL technical reports 2001020 and 2001030 are a good starting point for anyone interested in knowing more about symmetry and synchronization problems in optimization in general, and GAs in particular.
While hardware performance continues to double almost every year and a half, the doubling time for software performance is at least 20 years. I've been heavily involved in the process of this transformation, and so my views are not entirely unbiased (no one's views are entirely unbiased or so any good postmodernist would claim), but I think this move is good for ISGEC, good for ACM, good for the field of genetic and evolutionary computation, and good for computer science. First wave evolutionary computation researchers (and cyberneticists of all stripes) in the late 50s and 60s were just getting their sea legs under them when Minsky and Papert's premature and misleading hatchet job on neural networks appeared and took the wind out of cybernetic sails generally. With GECCO as an ACM conference proceedings, young faculty can safely put their work there without endlessly defending their choice. Department heads have told me that the decision to move from ISGEC to SIGEVO will make absolutely no difference to them. But, in the end, I believe the young people in the PhD pipeline will benefit immensely through higher probability of hiring, better chances to tenure, and improved prospects of funding as the once outlaw field of genetic algorithms and evolutionary computation comes in off the range, hangs up its six-shooter, and becomes a law-abiding denizen of the mainstream of computer science.
I have used it extensively to compute statistical tests, and I made a script to automatize most of the testing process. It uses the Bonferroni correction to ajust the significance level of the tests for the multiple comparisons.
The script generates draft plots just to check how do the results look, without the overhead of launching other programs that can generate more fancy plots like Matlab. We mostly use the cluster to run multiple GA runs to collect statistics, and also for testing the effect of different parameters on the scalability of the algorithms (For example, see Martin's blog). That means they produce a lot of heat, but nothing that a few well placed fans and couple of ACs can't fix. He tackled the problem by using a set of controllers and running it through a set of genetic algorithms to develop the motion he wanted out of them.
This still gave an overall 10-20% speedup when the found choice of optimizations were applied. Shortly thereafter, I proposed applying genetic algorithms to gas pipeline optimization and rule learning, did so, and went on to take my first teaching job at the University of Alabama.
And that thirst led me to try difference equations, Markov chains, transform methods, anything that I could get my hands on, to help me do a better job in GA analysis and design. For example, we might create a model for convergence time under selection alone, combine this with a model for mixing under crossover alone, and consider the pairwise effect as a dimensionless ratio of the two times.
Subsequently I met company executive Rob Frasca in another context, and got the chance to tell him how Affinova was the company that broke my entrepreneurial heart. Like supersonic airplanes, bullets create shock waves -- mini-sonic booms -- as they speed through the air. These efforts are the result of longstanding GA work at BBN initiated by Dave Davis (now of NuTech Solutions) and carried on by Dave Montana and other hardworking GA types.
Recently, I have notice the existence at least more than 20 different projects using such framework.
Using an intuitive graphical editor, complicated analysis and visualization task are rapidly deployed by simple drag & drop.
I have been using it in some of my recent research, and if you are interested in such areas, I definitely recommend you to take a look at it. A "good" population size should be the one that ensures reliable convergence to the optimum (where "reliable may be defined any way we want, for instance, that the algorithm finds the optimum successfully in 9 out of 10 independent runs). Thereafter, he successfully applies carefully designed MOEAs to problems in production planning and design of nearest-neighbor classifiers. Bean was on the faculty of the Department of Industrial and Operations Engineering (IOE) of the University of Michigan for many years, where he also held the position of Associate Dean for Academic Affairs.
The extracted knowledge may not only be used for further data mining, but may actually be re-fed into the system giving it further competence in solving problems in which dependent features, that is, building blocks, need to be processed effectively. So, I suggested him that there might be synchronization issues with the problematic instances and using a niching method might be the solution. No mention is made of who the reseachers are or what department, college, or lab they come from. Defining an equivalent test of fitness for targeting intelligence as an evolutionary goal for machines, however, has been elusive.

Finally, the rise of symbolic AI and the subsequent cybernetics winter made it academically disreputable to persist in the "folly" of genetic algorithms and evolutionary computation, but persist the field did, and we are here today because of the courage of a small group who swam against the intellectual currents of the 60s, 70s, and 80s.
The result is large numbers of powerful faculty in charge with ideas in mind that are more than a little behind the time.
In a busy world, trusted brands allow buyers to get quality products at low risk of error and low search costs. These optimizations seek to reduce memory or power usage, or to speedup the needed execution time.
