简体   繁体   English

设计神经网络进行销售预测

[英]design a neural network for sales prediction

I'm developing a distributedsystem in which a plan to use a neural network for sales prediction. 我正在开发一个分布式系统,其中计划使用神经网络进行销售预测。

A small description of the system: the system combine the workflow of modules such as Pharmacy, Patient (electric health record) and a Doctor module. 系统的简要说明:该系统结合了诸如药房,患者(电子病历)和医生模块等模块的工作流程。

the problem is: I have many information related to the sales of medicine and i'm a little bit confused in designing the neural network. 问题是:我有很多与药品销售有关的信息,在设计神经网络时我有些困惑。

My current design : Inputs (according to the data that can be taken from the database): 我当前的设计:输入(根据可以从数据库中获取的数据):

  1. Date: the date of selling the medicine to a customer (YYYY/MM/DD format). 日期:向客户出售药品的日期(YYYY / MM / DD格式)。
  2. Patient Age Category: a value that correspond to the age category of the patient ex: (1 -> 12: child, 13 -> 30: Young …..) 患者年龄类别:与患者年龄类别相对应的值,例如:(1-> 12:儿童,13-> 30:年轻…..)
  3. Patient Gender. 病人性别。
  4. Medicine ID: a value that correspond to the medicine. 药品ID:与药品相对应的值。
  5. Disease ID: a value that correspond to the disease which was discovered by the doctor who generate the prescription. 疾病ID:与产生处方的医生发现的疾病相对应的值。
  6. Medicine Unit Cost: a value that correspond to the cost of purchasing a medicine. 药品单位成本:与购买药品的成本相对应的值。
  7. Medicine Unit Sold: a value that corresponds to the cost of selling a medicine. 药品销售单位:与药品销售成本相对应的价值。
  8. Pharmacy Address ID : a value that corresponds to the address of the pharmacy. 药房地址ID:对应于药房地址的值。
  9. Season ID : a value that corresponds to the season of selling the medicine ex(summer, winter , … etc). 季节编号:一个值,对应于该药品的销售季节(夏季,冬季等)。

Outputs : 输出:

  1. Quantity : A Value that correspond to the quantity of the medicine that will be sold. 数量:与将要出售的药品数量相对应的值。
  2. Profit : A Value that correspond to the profit amount of selling the previous quantity. 利润:与出售先前数量的利润金额相对应的值。

the question is : I'm not sure if this design is meaningful, any better suggestion? 问题是:我不确定这种设计是否有意义,还有更好的建议吗?

and what type of networks should I use to implement this design .. I'm planning to use multi-layer recurrent network ... It's a good choice or there is a better model? 以及应该使用哪种类型的网络来实现此设计..我计划使用多层递归网络...这是一个不错的选择,或者有一个更好的模型?

note: I'm planning to implement the network using c# with "AForge.NET Framework". 注意:我计划使用带有“ AForge.NET Framework”的c#来实现网络。

hope this description is clear and simple, and sorry for my poor language. 希望此说明简洁明了,对于我的语言不好,我深表歉意。

IDs and categories make a poor choice of actual inputs, because they are not continuous and there is little meaning in their relative magnitudes. ID和类别对实际输入的选择不佳,因为它们不连续并且相对大小意义不大。 What IDs may be good for is to create separate networks for different categories, but, given the large number of IDs and categories you've defined here, this means you have a very large number of separate networks, requiring a large pool of training data, since it will be heavily diluted. ID可能适合于为不同类别创建单独的网络,但是,鉴于您在此处定义的ID和类别数量众多,这意味着您拥有大量单独的网络,需要大量的训练数据,因为它将被严重稀释。 Binary categories (like gender) might work, but anything with more than one category will probably not yield good results. 二元类别(如性别)可能会起作用,但是任何具有多个类别的内容都可能不会产生良好的结果。

Be very careful with neural networks, as, given a big enough network, you can create something that appears to give predictions, but that is really meaningless outside the training set. 对于神经网络要非常小心,因为有了足够大的网络,您可以创建看起来可以做出预测的东西,但是在训练集之外这毫无意义。 Be sure you have a large validation set that does not take part in the training. 确保您拥有不参加培训的大型验证集。

I question whether the date (in absolute terms) adds any value to the model. 我怀疑日期(绝对值)是否为模型增加了任何价值。 If you had many years of data, it might yield some meaning, but a measure like what quarter is the date in, is it a weekend or a weekday, or is it within n number of days from a major holiday (etc.) might be much more powerful. 如果您有很多年的数据,它可能会产生一些含义,但是诸如日期是哪个季度,是周末还是工作日,或者距重大假期不超过n天(例如)之类的量度可能更强大。

I don't know your immediate field, so I don't have any opinions on the network topology. 我不知道您的直接领域,因此对网络拓扑没有任何意见。 If you look up some papers, there may be review articles geared towards your area of study discussing what the trends are. 如果您查找一些论文,可能会有针对您的研究领域的评论文章,讨论趋势。

This appears to be missing some bits of info. 这似乎缺少一些信息。

In order to predict the future you need a baseline to go from. 为了预测未来,您需要一个基准。 For example, on a given prescription how likely is it that someone is going to refill it? 例如,在给定的处方上,某人要补充它的可能性有多大? Is that likelihood tied to any of your other metrics (age group, gender, date sold)? 这种可能性是否与您的其他任何其他指标(年龄组,性别,销售日期)相关联?

Also season only has bearing given the location of the patient(s), and probably only for certain types of drugs (I'm thinking cold/allergy/flu). 同样,季节仅取决于患者的位置,并且可能仅针对某些类型的药物(我在考虑感冒/过敏/流感)。 Further in order to provide any real seasonal accuracy you'd have to bring in data for that season such as allergen levels, etc. 此外,为了提供任何实际的季节性准确性,您必须输入该季节的数据,例如过敏原水平等。

Next, the cost of the medicine and profit expectations are really related only to the actual medicine and aren't necessary to predict sales likelihood. 其次,药品的成本和利润预期实际上仅与实际药品相关,而无需预测销售可能性。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM